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Abstract. We obtain sharp estimates for multidimensional generalisa- 
tions of Vinogradov's mean value theorem for arbitrary translation-dilation 
invariant systems, achieving constraints on the number of variables ap- 
^vjj , proaching those conjectured to be the best possible. Several applications of 

our bounds are discussed. 
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■ 1. Introduction 

-i— > | 

The investigation of Diophantine problems of large degree is in general 
fraught with difficulties only partially mollified by the presence of intrinsic 
diagonal structure. Indeed, such analyses as are made available via the Hardy- 
Littlewood (circle) method, when successful, involve complicated exponential 
sum estimates widely considered to be amongst the most challenging in the sub- 
ject. The application of Weyl differencing by Davenport, Birch and Schmidt, 
on the one hand, involves a delicate interplay between the singular locus asso- 
; ciated with the problem and the quality of ensuing exponential sum estimates 

(see jl], [5], [T3] ). As an inescapable feature of such approaches, the number 
of variables required in a successful treatment grows exponentially with the 
degree of the problem at hand. The extension of Vinogradov's methods to 
exponential sums in many variables, on the other hand, is notoriously compli- 
cated. The work of Arkhipov, Karatsuba and Chubarikov [I], [2], for example, 
permits substantially sharper conclusions to be drawn when partial diagonal 
structure is present. However, the complexity of the underlying methods has 
deterred a consideration of all but the simplest model situations (see [2] and 
[H]). In addition, the available conclusions fail to achieve their conjectured 
potential by a factor growing roughly like the logarithm of the total degree of 
the associated translation-invariant Diophantine system. 

Our goal in this paper is to extend the efficient congruencing method intro- 
duced by the third author [21J so as to accommodate the generalised Vino- 
gradov systems of Arkhipov, Karatsuba and Chubarikov (see [I], [2]). It tran- 
spires that for systems of large degree, the bounds that we thereby derive miss 
those conjectured to hold by a factor of only 2 or thereabouts, transforming 
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the previous state of the art. Moreover, our methods are of such flexibility that 
they may be successfully applied to translation-invariant systems of wide gen- 
erality, and in particular to systems closely related to those subject to recent 
investigations by quantitative arithmetic geometers studying the Manin-Peyre 
conjectures (see [TJJ §4.15], [TH] ). Since our methods yield estimates no less 
striking for such systems, we take the opportunity to derive rather general 
estimates again coming within a constant factor of those conjectured to hold. 
There are consequences of all of this work for exponential sum estimates of 
Weyl-type, for the solubility of systems of Diophantine equations and related 
problems, and for certain problems in additive combinatorics, and these we 
also explore herein. 

Rather than encumber the reader at this point with the substantial nota- 
tional prerequisites entailed by a discussion of our most general conclusions, 
we instead offer the more easily digestible corollaries particular to the model 
problem considered in earlier work [9J of the first author. Let s, k and d be 
natural numbers, and let X be a positive number. We focus attention on the 
system of simultaneous Diophantine equations 

s s 

J2x%x%---x% = j2y>%---y? d + (1.1) 

3=1 3=1 

Here, the indices i m are non-negative integers, so that a modest computation 
reveals the total number of equations in the system ( II. ip to be r = r^k, where 

fk + d\ i v 

U, = („)-!■ ("J 

Meanwhile, the total degree of the system ( II. ip . which is to say the sum of the 
degrees of the equations comprising the system, is equal to K = Kd t k, where 

i=i v 7 

In particular, in the familiar classical Vinogradov system with d — 1, one has 
r = k and K = ^k(k + l). Finally, we write J S) k,d(X) for the number of integral 
solutions of the system (II. ip with 1 ^ Xj m , yj m ^ X (1 ^ j ^ s, 1 ^ m ^ d). 

In ^21 as a special case of Theorem I2.1[ we derive an estimate for J s ,k,d{X) 
that is in many respects close to the best possible. Here and throughout, 
implicit constants in Vinogradov's notation <C and 3> depend at most on s, 
k, d and e, unless otherwise indicated. The letter X should be interpreted as 
a positive number sufficiently large in terms of s, k, d and e. 

Theorem 1.1. Suppose that s, k and d are natural numbers with k ^ 2 and 
s^r(k + l). Then for each e > 0, one has J s ,k,d( X ) < X 2sd - K+e . 

The special case of Theorem 11.11 with d — 1 is equivalent to [2U Theorem 
1.1], a conclusion which has very recently been sharpened in [2"2"j Theorem 1.1], 
so that for k ^ 3 and s ^ k 2 — 1, one has 

J., k ,i(X) < x 2s -^ k+l)+e . 
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When d > 1, meanwhile, one may compare the conclusion of Theorem 1 1 . 1 1 wit h 
work of Arkhipov, Karatsuba and Chubarikov |2j. Although set up slightly 
differently, it is clear that the methods of the latter authors have the potential 
to establish a bound of the shape 

J sAd (X) « X 2sd ~ K+A % 

where A s decays with s roughly like rke~ s ^ rk \ In 0, Theorem 1.1] this con- 
clusion was improved by the first author for sufficiently large values of k, so 
that on writing 

So = \rk(\ogk — 2 log log k), 

one may take 

(rke 2 ~ 2s ^ rk \ for 1 ^ s ^ s , 

s ~ 1 r(log k) 2 e 3 - 3 ( s - so ^( 2rk \ for s > s . 



The estimate supplied by Theorem II .11 is substantially sharper. Thus, provided 
only that s ^ r(k + 1), one may take A s = e for any positive number e. 
The number of variables required in typical applications, as we discuss in due 
course, is thereby reduced by a factor of order log(rfc). 

In order to discern the strength of the estimate supplied by Theorem II. 1[ we 
must consider available lower bounds for the mean value J s ^ t d{X), and thereby 
infer plausible conjectures for corresponding upper bounds. In §[3] we establish 
the lower bound for J s ,k,d{X) contained in the following theorem. 

Theorem 1.2. Suppose that s, k and d are natural numbers. Then one has 

d 

J s ,kA x ) > xsd + J2 x{2s ~ 1)l+d ~ K3 ' k - 

It seems reasonable to conjecture that whenever e > 0, one has the allied 
upper bound 

d 

J s ,k,d( x ) < A £ (x sd + J2x {2s - 1)l+d - K ^ A . 

i=i 

Thus, when s is sufficiently large in terms of k and d, one expects that 

J s , k4 {X) « X 2sd ~ K+ \ (1.4) 

as is confirmed by Theorem 11.11 for s is r{k + 1). We emphasise that here, 
and throughout the introduction, we abbreviate r^k to r and K^k to K. As 
a consequence of Theorem 11.21 we show in Theorem 13.21 that when 5 is a real 
number with 2d/k < 5 < 1, and 

s <25T2< 1 -^ + 1 )- 

then there is a positive number rj = t](d, k) with the property that 

J s>k , d (x) » x 2sd - K+ \ 
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When d ^ 2, therefore, it follows that the conclusion of Theorem 11.11 comes 
within a factor 2 + 2/ d+0(d/k) of the least value of s for which the conjectured 
upper bound (II. 4p might conceivably hold. In such multidimensional Weyl 
sums, a near-optimal conclusion of this type, merely a constant factor away 
from the best possible, has hitherto been wholly beyond our grasp. 

We next consider upper bounds for exponential sums of Weyl-type, the dis- 
cussion of which is much facilitated by the introduction of additional notation. 
It is convenient to abbreviate a monomial of the shape to x 1 , in 

which i = (21,22, • • • iidj- Likewise, we may write x^ for x^x 1 ^ ■ ■ ■ x l ^ d . In 
such circumstances, we put 

|i| = i x + . . . + i d . 

Also, in place of the s-tuple (x 1; . . . , x s ) we write x, and we adopt the conven- 
tion that a ^ v ^ b is to mean that each coordinate vi of the vector v satisfies 
a ^ Vi ^ b. Equipped with these conventions, the Diophantine system (11. ip 
assumes the compact shape 

x'+...+X* =yi + ...+yi 

and J s ,k,d{X) counts the number of integral solutions of this system with 1 ^ 
x,y^X. 

Define the exponential sum f{cx) = f d ^(cx;X) by 

f d)k (a;X) = ^ e OKx;aO), 

where 

■0(x; a) = atiX 1 , 

ls£|i|^fc 

and, as usual, we write e(z) for e 2mz . Here, the subscript d that identifies x as 
the (i-tuple (xi, . . . , Xd) may usually be omitted without leading to confusion. 
As we have already noted, the number of coefficients ot\ is r. We adopt the 
convention that, when G : [0, l) n — > C is measurable, then 

/G(/3)d/3= / G((3)d(3. 

It then follows from orthogonality that 

J S)M (X) = £\U k ( a ;X)\ 2s da. (1.5) 

Upper bounds for mean values of exponential sums such as /(ck) may be 
converted into Weyl-type estimates by means of variants of the large sieve 
inequality. Before announcing such an estimate, which is a consequence of the 
more general result recorded in Theorem 110.31 we pause to record a further 
notational convention. When a G Z™, we write (q, a) for the greatest common 
divisor (q, a±, . . . , a n ). 
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Theorem 1.3. Suppose that d and k are natural numbers with k ^ 2. Let a 
be any real number with 

a' 1 > ^2k(^ k + d d ~ 1 ^-2k + l ) j (d + 1). 

Then whenever \ f djk (ct; X)\ ^ X d ~ a+£ , for some e > 0, it follows that there 
exist gGN and aj G Z (1 ^ |j| ^ k) satisfying 

(q, a) = 1, 1 ^ q ^ X ka and \qa 3 — aj| < X fe<T ~ |j| (1 < |j| < fc). 

The special case of Theorem 1 1 . 3 1 wit h d = 1 is very slightly weaker than [211 
Theorem 1.6], a conclusion which has recently been sharpened in [221 Theorem 
11.2]. Thus, when d = 1, the conclusion of Theorem 11.31 holds whenever k ^ 4 
and <7~ * ^ 4A;(A; — 2). The work of Arkhipov, Karatsuba and Chubarikov [2], 
as interpreted and sharpened by the first author, yields a conclusion similar 
to Theorem 11.31 Indeed, it follows from a corrected version^ of [9, Theorem 
1.2] that a conclusion of similar form holds for sufficiently large values of k, 
though with the constraint on the exponent a replaced by a condition of the 
shape a -1 ^ f(^+ l)rk\og(rk). On noting that ( fc+ ^ 1 ) — 1 = r^k-i ^ ?"d,jfc, 
the superiority of our new bound is clear. 

We next consider the application of our new estimates to Diophantine prob- 
lems. When s, k and d are natural numbers, and a\j is a non-zero integer for 
1 ^ |i| ^ k and 1 ^ j ^ s, write 

s 

0i(x) = ^a iiX } 

In §fTl~l we consider the Diophantine system 

0i(x) = O (l^|i|<Jfe), (1.6) 

consisting of r equations of total degree K. Let N(B) denote the number of 
integral solutions of the system (11.61) with |x| ^ B. We follow Schmidt JT3] 
when defining the (formal) real and p-adic densities associated with the system 
(jD)jl . When L > 0, define 

L(l — L|7/|), when \r]\ ^ 
0, otherwise, 



and put 



The limit = lim when it exists, is called the real density. Meanwhile, 
given a natural number g, we write 

M{q) = card {x G (Z/qZ) sd : 0,(x) = (mod g) (1 < |i| < Jfe)} . 



1 See the discussion following the proof of Theorem 110.21 below for an explanation of the 
need for a modest correction in [51 Theorem 1.2]. 
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For each prime number p, we then put 

a p = lim p h(r - sd) M(p h ), 

h— >oo 

provided that this limit exists, and refer to a p as the p-adic density. 

As a special case of Theorem 111.11 we establish an asymptotic formula for 
N(B) valid whenever s ^ 2r(k + 1) + 1. 

Theorem 1.4. Suppose that s, k and d are natural numbers with k ^ 2 and 
s ^ 2r(k + 1) + 1. In addition, let a\j (1 ^ |i| ^ k, 1 ^ j ^ s) be non- 
zero integers. Then provided that the system of equations U.6]) possesses non- 
singular real and p-adic solutions for each prime number p, one has 

N(B) ~ ffoo (j[ a p j B sd ~ K . (1.7) 

In particular, the system U.6}) satisfies the Hasse Principle. 

We note that [211 Theorem 9.1] delivers the same conclusion as Theorem 
11.41 in the special case d — 1. As is apparent from the lower bound supplied 
by Theorem 11.21 and the ensuing discussion, there exist choices of coefficients 
a for which the asymptotic formula (jl.7p necessarily fails when k is large and 

s^-^j(l + 0(d/k))r(k + l). (1.8) 

Consequently, the bound on the number of variables in the hypotheses of 
Theorem 11.41 is within a factor 2 + 2/d + 0(d/k) of the best possible bound 
for such systems. Indeed, the argument underlying the proof of Theorem 
11.21 shows that such remains true in wider generality. The point here is that 
special subvarieties contain the bulk of the set of integral solutions whenever 
the bound (11.81) holds on the number s of blocks of d variables. Conclusions 
available hitherto of the type presented in Theorem 11.41 impose bounds on the 
number of blocks of variables weaker than our own by a factor of order log(rfc). 

As a special case of Corollary lll.2[ we obtain an asymptotic formula for the 
mean value J Sj k,d{X). 

Theorem 1.5. Let k and d be natural numbers with k ^ 2. Then whenever 
s ^ r(k + 1) + 1, there exist positive constants = ^(s, k, d) and 5 = 5(k, d) 
such that 

j s>k ,d{x) = ^x 2sd ~ K + o{x 2sd - R - 5 ). 

A conclusion analogous to that of Theorem 11.51 is obtained in 0, Theorem 
1.3], subject to the condition that k be sufficiently large and 

s ^ r&(| logr + | log k + log log A; + 2d + 4). 

Again, the conclusion of Theorem 1 1.5 1 is much superior. When d — 1 and k ^ 3, 
meanwhile, the conclusion of Theorem 11.51 is a consequence of (211 Theorem 
1.2]. 
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As a penultimate application of the bounds supplied by Theorem in £ fl~T1 
we consider the rational linear spaces of projective dimension d — 1 lying on 
the diagonal hypersurface 

Cl z\ + ... + c s z k s = 0, (1.9) 

with Ci, . . . , c s fixed non-zero integers. Such a linear space may be written in 
the form 

j£?(xi, . . . , x rf ) = {tixi + . . . + t d it d :ti,...,t d e Q}, 

for suitable linearly independent vectors x 1; . . . ,x d 6 If . As noted in [9J, by 
substituting into (11.91) and using the multinomial theorem to collect together 
coefficients of t 1 , one finds that the linear space J?f (x) corresponds to a solution 
yi; • • • i Ys £ 2^ of the Diophantine system 

c iy \ + ... + c s yl = (\i\ = k). (1.10) 

This correspondence is made explicit by means of the simple relation 

x i:j = yji (1 ^ i < d, 1 < j < s). 

Write N S) k,d(X) for the number of integral solutions of the system fll.lOp with 
|y| ^ X, and put 

In §11 we indicate how to prove an asymptotic formula for N s ^ td (X) subject 
to the condition that s ^ 2r(k + 1) + 1. In this context, we say that the integral 
s-tuple c is a non-singular choice of coefficients for k and d when the system 
of equations (11.101) has non-singular real and p-adic solutions, for every prime 
number p. 

Theorem 1.6. Suppose that s, k and d are natural numbers with k ^ 2 and 
s ^ Irik + 1) + 1. Suppose further that c 6 (Z\ {0}) s is a non-singular choice 
of coefficients for k and d. Then there exist positive constants *3) = @(s, k, d; c) 
and v = u(k, d) such that 

N s , k , d (X) = $X sd ~ L + 0{X sd - L - u ). 

In particular, one finds that whenever s ^ 2r(A; + 1) + 1 and appropriate local 
solubility conditions are met, then the hypersurface defined by (II. 9p contains 
an abundance of rational linear spaces of projective dimension d — 1. A perusal 
of [9 J reveals that, for sufficiently large values of k, a similar conclusion is 
asserted by Theorem 11.41 of the latter source, subject instead to the more 
stringent condition 

s ^ r/c(|logr + log/c + 2 log log k + Ad + 8). 

We remark that the lower bound s ^ 2r(k + 1) + 1 in Theorem 11.61 should be 
susceptible to some small improvement by adapting the methods of [20] to the 
present multidimensional setting. Moreover, when the degree k is very small, 
an approach of the first author [10] motivated by a method of Hua proves 
superior in some situations. 
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Further applications within the orbit of our methods and bounds include 
the generalised Waring problem of representing a given polynomial 

t d ) e Z[t] 

in the form 

s 

$ (*) = X^* 1 + ■■■+ x djt d )\ 

and also results concerning the number of integral solutions of Diophantine 
inequalities modulo 1. We refer the reader to [2] for a discussion of some 
such problems, and leave to the reader the satisfaction of incorporating our 
new bounds into the established methods so as to make similarly striking 
improvements over the previous state of knowledge. 

As a final application of our new bounds for multidimensional Weyl sums, 
we announce an application in additive combinatorics based on the second 
author's recent work [TT] on translation invariant systems of equations devoid 
of solutions in multidimensional sets. In Theorem 111.31 below we present a 
conclusion more general than the one we presently record in Theorem 11.71 For 
the purpose at hand, we describe the integral s-tuple c as an extended non- 
singular choice of coefficients for k and d when (i) one has Cx + . . . + c s — 0, 
and (ii) the system of equations 

ciyl + ... + c s y^ = (l.H) 

has non-singular real and p-adic solutions, for every prime number p. 

Certain solutions of the system (II. lip are atypically simple to obtain, such 
as the trivial solutions lying on the diagonal yi = y2 = • • • = y s - We formalise 
this notion by distinguishing two types of special solutions y of (11. lip . We 
describe y as projected when there is a translate of a proper subspace of Q d 
that contains all of yi,...,y s . The aforementioned diagonal solutions are 
therefore projected, since they lie in a translate of the trivial subspace {0} of 
Q d . Also, we say that y is a subset-sum solution when there exists a partition 
{1,2, ... ,s} = ^\ U . . . U ^i, into / ^ 2 disjoint non-empty sets J? v , such that 
for 1 ^ v ^ I one has 

]T c u yl = (1 < |i| < k). 
In the special case in which 

Cu = (1 < u < 0, 

one sees that there are trivial subset-sum solutions in which y u = y w whenever 
we /„ and we ^ v (1 < v < I). 

Theorem 1.7. Suppose that s, k and d are natural numbers with k ^ 2 and 
s ^ 2r(k + 1) + 1. Suppose further that c £ (Z \ {0}) s is an extended non- 
singular choice of coefficients for k and d, so that c\ + . . . + c s = 0. Let stf 
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be a subset of Z* d R [1, N] d , and suppose that the only solutions of the system 
M.ll\) from .stf are either projected or subset-sum solutions. Then one has 

card(^) < iV^loglogiV)- 1 ^- 1 ). 

This theorem is a higher dimensional cousin of [TTj Theorem 5.1], which 
supplies an analogous conclusion for a case involving binary forms. Theorem 
11.71 shows that when card(.e/) grows more rapidly than iV d (loglog iV) -1 /^ -1 ), 
then the system (II. lip contains solutions from stf besides such obvious ones 
as the diagonal solutions with yi = . . . = y s . As we see in §3, the extended 
system (11.111) contains more general special subvarieties defined by means of a 
projection process, the simplest of which set one or more variables to be zero. 
In Theorem 111.31 we present a conclusion that refines Theorem 11.71 in which, 
under the same hypotheses concerning the cardinality of one finds that 
the system (II. lip contains solutions from stf which avoid all of these special 
subvarieties. In this way, one may legitimately describe the solutions of (II. lip 
thus shown to exist as honestly non-trivial. The interested reader will find the 
necessary ideas in earlier work [11] of the second author. 

It may be useful to provide an informal sketch hinting at the argument un- 
derlying the proof of Theorem 11.11 so that the reader is better prepared to 
draw parallels with previous approaches. A more comprehensively illuminated 
sketch of this argument in the case d = 1 may be found in [21, §2]. In common 
with the previous approaches of [2] and [9], the basic tool employed in our 
proof of Theorem II. II is a (so-called) p-adic iteration mirroring the one devised 
by Linnik [5] in the classical setting with d — 1. Thus, we begin by artifi- 
cially introducing a congruence condition, modulo a suitable prime p, amongst 
the bulk of the variables underlying the mean value (11.51) . An application of 
Holder's inequality leads to a new mean value in which the latter variables 
lie in common congruence classes across blocks. At this point, the multiple 
translation invariance of the system (11.11) may be utilised so as to pass to the 
zero congruence class, and thereby a congruence condition is forced on a subset 
of the variables of greater strength than that previously introduced. The ap- 
proach of [2] is to choose the prime p in such a way that this strong congruence 
condition forces a diagonal condition amongst blocks of variables, and thereby 
one is able to bound a mean value involving 2(s + r) blocks of d variables in 
terms of a corresponding mean value involving 2s blocks of d variables. In [9] 
the strong congruence condition is interpreted as a differencing process analo- 
gous to, though more efficient than, that of Weyl. By appropriate use of the 
Cauchy-Schwarz inequalities, one is able to repeat this efficient differencing 
process, deferring the moment at which to force the diagonal condition. In 
the present paper, following [2TJ, we instead interpret the strong congruence 
condition as an efficient method of imposing a second artificial congruence 
condition amongst variables. By appropriate application of Holder's inequal- 
ity, one recovers a new mean value resembling that obtained in the first step, 
but now yielding a fresh congruence condition amongst variables significantly 
stronger than before. If one begins with a mean value significantly larger in 
size than anticipated, then repeated application of this efficient congruencing 
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procedure yields a related mean value larger in size than that anticipated by an 
amount so large that even a trivial estimate demonstrates the presumed initial 
deviation from the expected size to be untenable. In this way, one shows that 
the mean value under consideration has size very close to that expected. 

We finish by emphasising that the methods of this paper are robust to 
changes of the ambient ring. Thus the rational integers Z central to this paper 
may be replaced with the ring of integers from a number field, or the polynomial 
ring ¥ q [t], without diminishing the strength of the ensuing estimates. Such 
ideas have been explored very recently in the case d — 1 in work emerging 
from the body of research exploiting the efficient congruencing method (see [7j 
and [23]). 

In §2] we introduce the general translation-dilation invariant systems which 
constitute the central objects of attention in this paper. Then, in ^J3j we discuss 
the lower bounds recorded in Theorem 11.21 The notation and infrastructure 
required for our most general conclusions is discussed in §|H and then in §S] we 
derive the basic mean value estimates which initiate our efficient congruencing 
argument. Next, in §j6j we provide estimates for the number of solutions of 
a system of basic congruences. Here, the singular locus of the system is of 
particular concern. The conditioning process, required to guarantee appropri- 
ate non-singularity conditions, is engineered in £J7J and in £JH] we discuss the 
efficient congruencing process itself. In §9] we combine the output of §^7]and[8] 
so as to deliver Theorem 11.11 via an iterative process. Consequences for Weyl- 
type estimates are discussed in ^TU| yielding the conclusion of Theorem 11.31 
Finally, in §111 we sketch the arguments required to establish the Diophantine 
consequences recorded in Theorems I1.4H1.7I 



In order to describe our most general conclusions, we must introduce some 
notation having flexibility sufficient for our needs. An overly prescriptive ap- 
proach has the potential to shroud the details of our arguments in a thick 
blanket of impenetrable symbols. With this undesirable potential outcome in 
mind, we opt for a somewhat abstract approach, and only later do we spend 
time detailing the most interesting situations. 

Let r, s and d be natural numbers, and consider a system of homogeneous 
polynomials F = (F 1 , . . . , F r ), where Fj(z) e %[zx, . . . , z d \ (1 ^ j ^ r). We 
investigate the system of Diophantine equations 



in which x, = (x a , . . . ,x id ) and = (y a , . . . ,y id ) for 1 ^ i ^ s. Note 
that, in view of our conventions concerning vector notation, the system (12.1 p 
consists of r simultaneous Diophantine equations. Write x = (xi, . . . , x s ) and 
y = (yi, . . . , y s ), and denote by J S (X; F) the number of integral solutions of 
the system with 1 «C x,y < X. 



2. Translation-dilation invariant systems 





8=1 
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In this paper we are concerned with translation-dilation invariant systems 
of the shape (12.11) . With the discussion to come in mind, we take a pragmatic 
approach to defining such systems. We say that the system F = (Fx, . . . , F r ) 
is translation- dilation invariant if: 

(i) the polynomials Fx, . . . , F r are each homogeneous of positive degree, and 

(ii) there exist polynomials 

Cjl G Z[£i, ...,£*] (1 ^ j ^ r and ^ i< j), 
with Cjj = 1 for 1 $C j ^ r, having the property that whenever £ G 7L d , then 

i 

F J (x + ^) = c j0 (O + ^c :;7 (O^(x) (Kj<r). (2.2) 

l=i 

Extend the definition of the coefficients cji by putting Cji(£) = when I > j. 
Then on writing c (£) = ( c jo(^))i^Xr and C(£) for the matrix (cji($))x^j,i^ r , 
we see that the relations (12.21) are summarised by the formula 

F(x + = C(£)F(x) + c Q (£). (2.3) 

Notice that the matrix C(£) is lower unitriangular, which is to say that it 
is a lower triangular matrix whose main diagonal entries are all 1. Suppose 
that s is a natural number, that A is a non-zero rational number, and £ G Z d . 
Then we see from (12. 2p that the Diophantine system (12. ip possesses an integral 
solution x, y if and only if one has 



(F(Ax, + £)-F(Ay j+ £))=0. (2.4) 

1=1 

This observation justifies the description of such systems of equations as trans- 
lation-dilation invariant. We should note that while this formal definition fa- 
cilitates many of our arguments, it is clear that one may rearrange the ordering 
of the forms, and also consider independent linear combinations of the original 
forms, without altering the number of integral solutions of the system (12. ip 
counted by J S (X;F). Thus we may be expedient in most circumstances, and 
instead describe a system as translation-dilation invariant when it is equivalent 
in such a manner to some new system which is translation-dilation invariant 
in the strict sense. 

We emphasise that translation-dilation invariant systems are easily gener- 
ated. Given a collection of homogeneous polynomials 

Gx, ■ ■ ■ , Gh G Z[zi, . . . , Zd], 

consider the set & consisting of all the partial derivatives 

[l^j^h), (2.5) 



d£ ...dz l d 

with li ^ (1 ^ i ^ d). Plainly, when lx + ■ ■ ■ + Id exceeds the largest total 
degree of any of the polynomials Gj, this partial derivative vanishes. The 
set is consequently finite. Let J?o denote the subset of & consisting of 
all polynomials in & having positive degree. We write J^o = {-^i, • • • ,F r }, 
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labelling the elements in such a way that degFi ^ degF 2 ^ . . . ^ degF r . 
An application of the multidimensional version of Taylor's theorem now shows 
that the relations (12. 2p hold for some choice of coefficients Cji(£) G Z[£i, ■ ■ ■ , Cd] 
satisfying Cjj(£) = 1 (1 ^ j ^ r). Since we may replace the set of forms JF by 
any subset whose span contains the polynomials Fx, ... , F r , there is no loss of 
generality in supposing the set {Fi, . . . , F r } to be linearly independent. Such 
a system of forms we call reduced. 

Finally, by replacing the forms Fx, . . . , F r by appropriate linear combinations 
of the original forms, we find that there is no loss of generality also in supposing 
that the matrix C(£) with entries Cji($,) is lower unitriangular. This new system 
F = (Fi, . . . , F r ), generated from the partial derivatives (12.51) . is a reduced 
translation-dilation invariant system. 

We are now almost equipped to state our main theorem, but first pause 
to introduce some parameters associated with a translation-dilation invariant 
system of polynomials F. When F = (Fx, . . . , F r ) consists of polynomials 
Fj(z) G Z[^i, . . . ,z d ], we refer to the number of variables d = d(F) in F as 
the dimension of the system. In addition, we describe the number of forms 
r = r(F) comprising F as the rank of the system. We write kj = kj(F) for the 
total degree of the polynomial Fj, and then define the degree k = k(F) of the 
system F by 

fc(F) = max%(F), 

and the weight K = K(F) by 

jr(F) = £>(F). 

i=i 

Our goal in §§4)19] is the proof of the following mean value estimate, which 
represents the main theorem of this paper. 

Theorem 2.1. Let F be a reduced translation- dilation invariant system of 
polynomials having dimension d, rank r, degree k and weight K . Suppose that 
s is a natural number with s ^ r(k + 1). Then for each e > 0, one has 
J S (X;F) < x 2sd - K+£ . 

So far as we are aware, no mean value estimate available in the literature has 
generality to compete with Theorem 12 .![ Moreover, when d ^ 2 the estimates 
available hitherto are considerably weaker, even in the special situations in 
which they are applicable. In order to illustrate the ease with which estimates 
may be extracted from Theorem 12.11 we finish this section with a brief discus- 
sion of some simple cases, and in particular we show how to establish Theorem 
11.11 as a consequence of Theorem 12.11 

(a) The classical system of Vinogradov [IB] . [T7] . Consider the seed polynomial 
z k (k ^ 1). By taking successive derivatives, we find that an associated reduced 
translation-dilation invariant system of polynomials is F = (z k , z k ~ l , . . . , z). 
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This system has dimension 1, rank k, degree k and weight 

k 
3=1 

Then it follows from Theorem 12.11 that when s ^ kik + 1), one has 

J S (X;F) ^ X 2s ~^ k+ ^ +£ . 

This estimate recovers the main conclusion of the third author's recent work 
introducing the efficient congruencing method to Vinogradov's mean value 
theorem (see [211 Theorem 1.1]). We note that subsequent work of the third 
author leads to the improved constraint s ^ k 2 — 1 on the number of variables 
in this conclusion (see [221 Theorem 1.1]). 

(b) The system of Parsell [9J. Consider the situation with d ^ 2 and seed 
polynomials z\z^ ■ ■ ■ z l £ (|1| = k). By taking successive partial derivatives, 
we find that an associated reduced translation-dilation invariant system of 
polynomials is 

F = (z^z% ...z^ : 1 ^ |i| < k). 
This system has dimension d, rank 




degree k and weight 

1=1 \i\=l 1=1 \ / 

In this instance, it follows from Theorem 12.11 that when s ^ r(k + 1), one has 
J S (X] F) < x 2sd - K+£ . In view of ([LSD and (Q, this completes the proof of 
Theorem 11.11 

(c) The system of Arkhipov, Karatsuba and Chubarikov [2]. Consider the sit- 
uation with d ^ 2 and I ^ 1 and the seed polynomial z\z\ . . . z\. By taking 
successive partial derivatives, we find that an associated reduced translation- 
dilation invariant system of polynomials is 

F = {z\ l z% . . . 4 d : ^ i ^ I, i ^ 0). (2.8) 

This system has dimension d, rank 

r= £ 1= £ ... £ l-l = (/ + l) d -l, (2.9) 

(Ki<2 O^h^l O&d^l 

i^O 
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degree dl, and weight 

*=£n = £ ••• £fr + -+*) 

O^i^i O^ii^Z 0^i d ^l 

= d £ • • • £ Zd 

= d(l + l)^ 1 • i/(Z + 1) = |dZ(i + l) d . (2.10) 

In this instance, Theorem 12.11 delivers a conclusion important enough to sum- 
marise as a corollary. 

Corollary 2.2. Let d and I be natural numbers, and let F be the reduced 
translation- dilation invariant system given by Ii2.8\) . Suppose that s is a natural 
number with s ^ (dl + 1)((/ + l) d — 1). Then for each e > 0, one has 

J S {X;F) < x 2sd - K+£ , 

where K = \dl{l + l) d . 

A conclusion similar to that provided by Corollary 12.21 but with the condi- 
tion (dl + 1)((7 + l) d - 1) replaced by 

s ^ Cdl{l + l) d log{dl{l + l) d ), 

for a suitable positive constant C, may be extracted from [21 Theorem 1 of 
Chapter III. 1] . The superiority of our new bound is self-evident. 

(d) Simple binary systems. A system of relevance to recent work in quantitative 
arithmetic geometry (see [HJ §4.15],[15]) deserves to be singled out for special 
attention. Consider the situation with k\ ^ k 2 1 and the seed polynomial 
z^z^ 2 . By taking successive partial derivatives, we find that an associated 
reduced translation-dilation invariant system of polynomials is 

F = (zi'zi? : ^ it < ki, < i 2 < k 2 and (i u i 2 ) ^ (0, 0)). (2.11) 

This system has dimension 2, rank 

r= £ E l-l = (*i + l)(fe + l)-l, 
degree k\ + k 2 , and weight 

K = E E 

= (An + 1) • f & 2 (A; 2 + 1) + (A; 2 + 1) • {k^h + 1) 

= |(A;i + k 2 ){k l + l)(/c 2 + 1). 

By applying Theorem 12.11 we obtain the following corollary. 

Corollary 2.3. Let k\,k 2 € N, and let F be the reduced translation- dilation 
invariant system given by \2.11\) . Suppose that s is a natural number with 
s ^ (k\k 2 + k\ + k 2 ){ki + k 2 + 1). Then for each e > 0, one has 

J S (X;F) <^X 4s - K+£ , 

where K = \(k x + fc 2 )(/ci + l)(k 2 + 1). 
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(e) The binary systems of Prendiville [TT]. Consider the situation with k ^ 1 
and the seed polynomial given by the binary form $(zx, z-i) G Z[z±, z 2 ] of degree 
k. In this instance, we extract the partial derivatives 

d h+i2 <$>(z l ,z 2 ) , 

dz^dz'i V ; 

and restrict attention to any subset which spans the set of all partial derivatives 
of positive degree, yet is linearly independent over Q. We take the polynomials 
in this spanning set to be our reduced translation-dilation invariant system F. 
The number of partial derivatives with %\ + i 2 = I is plainly I + 1, while the 
number of monomials z^z^ with ji + 32 — m is m + 1. Thus we see that this 
system has dimension d = 2, rank 

[k/2] fc-[fc/2]-l 
1=0 m=l 

degree k and weight 

[k/2] fc-[fc/2]-l 

tf<^T(Jfe-Z)(Z + l) + ^ m{m + l) ^\k(k + 2f. 

1=0 m=l 

In typical situations, indeed, one has K ~ |fc 3 . By applying Theorem 12.11 we 
deduce that when s ^ \k{k + l)(fc + 4), one has J S {X; F) < x 4s "^ +£ . This 
conclusion may be compared with the mean value estimate underlying [Tlj 
Theorem 1.3], which delivers a similar conclusion for s ^ (| + o(l))fc 3 log A;. 
The constraint on s imposed in our present work is therefore stronger by a 
factor (| + o(l)) \ogk. 



3. Lower bounds 

In order to put into perspective the upper bounds recorded in Theorem I2.1[ 
and such corollaries as Theorem 11.11 we consider in this section the topic of 
lower bounds for the mean value J S (X;F). Here one must consider integral 
solutions to the system of equations fl2.ll) of two types. On the one hand, there 
are typical solutions whose contribution to J S (X; F) we expect to be given by 
a product of local densities. On the other hand, there are integral solutions 
lying on special subvarieties, the most obvious of which are diagonal linear 
spaces such as that given by Xj = Vj (1 ^ i ^ s). It transpires that when 
d > 1, there are special subvarieties not of the latter type which potentially 
make the dominant contribution to J S (X;F). In order to describe the latter 
subvarieties, we must introduce some further notation. 

Let r and d be natural numbers, and consider a system of translation-dilation 
invariant polynomials F = (Fi, . . . , F r ), where Fj(z) £ Z[zi, . . . , z^] (1 ^ j ^ 
r). Let 5 be a natural number with 1 ^ 8 ^ d — 1, and consider indices i\ 
(1 ^ I ^ 6) satisfying 



1 ^ i\ < %2 < ■ ■ ■ < is ^ d. 



(3.1) 
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We say that the system of polynomials G = (G±, . . . , G r ), where Gj(w) E 
Z[w\, . . . ,ws] (1 ^ j ^ r), is the orthogonal projection of F determined by i 
when 

G J (w) = F J (C) (Kj^r), 
in which ( m = wi when m = i\ for some index I with 1 ^ I ^ 8, and ( m = 
when m ^ {ii, . . . , z^}. The system G remains translation-dilation invariant, 
and may be replaced by an equivalent reduced system G'. We describe G' as 
a reduced orthogonal projection ofF determined by i. Finally, write ^(F) for 
the set of all reduced orthogonal projections of F determined by sets of indices 
{«!,... ,is} satisfying fl3.ll) . We remark that these orthogonal projections are 
in fact a special case of the more general projections introduced in the preamble 
to Theorem 11.71 In this section we consider only the former projections, since 
they are simpler to analyse and in any case deliver all of the salient features 
of importance for our discussion of lower bounds. 

In order to facilitate our subsequent discussion, we define the polynomial 
i/j(x] ct) = i/j(x] ct; F) by putting 

r 

V(x;a;F) = ^a i F i (x), (3.2) 

i=l 

and then define the associated exponential sum f(ct) = f(ct; X; F) by 

/(a;X;F)= ]T e(V(x;a;F)). (3.3) 

By orthogonality, we then have 

J S {X;F)= hf(cx;X;F)\ 2s dcx. (3.4) 

We are now equipped to describe our most general lower bound for the mean 
value J S (X; F). 

Theorem 3.1. Let F be a reduced translation-dilation invariant system of 
polynomials having dimension d and weight K . Then for each natural number 
s, one has 

d-l 

J s (X; F) > X sd + X 2sd ~ K + V X d ~ 5 max J s (X; G) . 

«=1 GGJT 4 (P) 

Proof. We consider first typical solutions of the system (12.11) not constrained 
to lie on special subvarieties. Suppose that F has rank r, and write kj for the 
degree of Fj for 1 ^ j ^ r. There exists a positive number A, depending at 
most on d, k and the coefficients of F, such that whenever 1 ^ x ^ X, one has 

\Fj(x)\ ^ AX k i (l^j^r). 

Consequently, when 1 ^ x, y ^ X, one sees that for 1 ^ j ^ r the integer 

s 

i=i 
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lies in the interval [—2sAX kj ,2sAX kj ]. We therefore deduce by means of 
orthogonality in combination with the triangle inequality and (13. 4p that 

[X] 2sd = f |/(«;X;F)| 2s e(-ai/ii - ... - a r h r ) da 

\hj\^2sAX k j 
(l<i<r) 

« ( J] X k ^ (f\f( a ;X;F)\ 2s da = X K J s (X;F). 
Thus we conclude that 

J S (X;F) > X 2sd ~ K . (3.5) 

Next, by considering the diagonal solutions of (12.11) with 1 ^ x, y ^ X and 
x i = yj (1 ^ j ^ s), we obtain the lower bound 

J S (X;F) >X sd (3.6) 

We now come to consider the solutions of the system (12.11) lying on certain 
additional special subvarieties. We assert that when G is a reduced translation- 
dilation invariant system with dimension e ^ 2, then for every system H lying 
in 7r e _i(G), one has 

J S (X;G) >XJ S (X;H). (3.7) 

In view of (13.51) and (I3.6p . the lower bound claimed in the statement of Theorem 
13. II then follows by induction. 

Let G = (Gi, . . . , G u ) be a reduced translation-dilation invariant system of 
dimension e > 1, and consider a system H G 7r e _i(G). The system H is the 
orthogonal projection of G determined by some (e — l)-tuple i = (zi, . . . , i e -i)- 
By relabelling variables, if necessary, we may suppose that i = (1, 2, . . . , e — 1). 
Consider now the system of equations 

s 

£(G(xO " G(y*)) = 0. (3.8) 

i=l 

Let a be an integer with 1 ^ a ^ X, set a = (0, . . . , 0, a), and consider the 
effect of the translation (xj, yj) i — > (xj — a, yj — a). In view of the translation 
invariance of the system (13.81) that is a consequence of the discussion leading 
to (12.41) . one finds that whenever the system of equations 

^(G(x, - a) - G(y, - a)) = (3.9) 

i=i 

is satisfied, then so too is the system (13.81) . If we substitute %i e = yi e = a (1 ^ 
i ^ s), then we find that G(x,; — a) = H(xj), where H is the aforementioned 
orthogonal projection of G determined by (1, 2, . . . , e — 1). Write H' for any 
reduced system equivalent to H. Then we conclude that whenever z, w is a 
solution of the system 

s 

£(H'( Zi ) - H'( Wi )) = 0, 
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then the system (13. 9p has the solution 

x» = (z i; a) and y* = (w i( a) (1 < z < s). 
The latter is also a solution of (13.81) . and hence 

J S (X;G)^ J2 Js(X;H')= J2 Js(X;H) = [X}J s (X;Il). 

This confirms the lower bound (13.71) . and in view of our earlier discussion the 
proof of the theorem is complete. □ 

We turn now to discuss lower bounds for J S (X; F) for the most basic exam- 
ples of reduced translation-dilation invariant systems F. 

(a) The classical system of Vinogradov. We recall the system 

F = (z k ,z k -\...,z) 

of dimension 1, rank k, degree k and weight \k[k + 1). In this situation, the 
conclusion of Theorem 13.11 delivers the familiar lower bound 

J S {X; F) > X s + x 2s ~^ k+l \ 

(b) The system of Par sell. We next return to the situation with d ^ 2 and the 
system F d = (z^z^ ■ ■ ■ z d d : 1 ^ |i| ^ k). On writing 

= + s ,~ l )< < 3 - 10 > 

;=i ^ ' 

we see that this system has weight K d , and it follows from Theorem 13. II that 

d-l 

J S {X; F d ) > X sd + X 2sd - Kd + ^X d - s J s (X- Fa) 

<5=1 

d 

> x sd + J2x d ~ 5 (x 2sS ~ Ks + X s5 ). 

5=1 

We therefore conclude that 

d 

J S (X; F d ) > X sd + X {2s - 1)j+d ~ K \ 

3=1 

and this establishes Theorem 11.21 

Let us consider the strength of the upper bound presented in Theorem 11.11 
in the light of the lower bound just established. Define r and K as in (12. 6p 
and (12.71) . When s is large enough, we expect that J S (X; F d ) <C x 2sd ~ K , and 
indeed this estimate is a consequence of Theorem 11.51 when s ^ r(k + 1) + 1. 
We show here that this lower bound on s cannot be relaxed substantially when 
k is large in terms of d. 
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Theorem 3.2. Suppose that s, k and d are natural numbers, and that v is a 
real number with 2d/k < v < 1. Then there is a positive number r\ = r)(d, k) 
such that, whenever 

8^^^{l-u)r(k + l), (3.11) 
one has J S , M (X) > X 2sd - K+T] . 

Proof. Recall the notation introduced in f 1 3 . X [) . and consider a natural number 
s satisfying (13. lip . We observe that as a consequence of Theorem 11.21 one has 

X 2sd-K d ^ X 2sd - K J ~ ' 1 ' 

From equations (ll.2p and ( II ,3p . one has 

dk (k + d\ d . „ . , 

K d = - , = (r + l)k 

d+l\ d J d+V ' 

and 

(d-l)k fk + d- 1\ d-1. 1 . 1 

Consequently, one finds that 

K d - K d _i + 1 - 20 ^—(r + l)k - ^Z±(r + l)k + l 

d + 1 k + d 

<l -(1 - v)r{k + 1) 



d + 1 

1)i ( 1 _^_ (1 _„ )(1+1/fc) ). 

Then provided that v > 2d/k, one may infer that 

d , . /2c? — 1 d 1 \ 

- + 1 - 2* > j^fr + D*(— " — d + jj^) > o. 

One concludes therefore that there exists a positive number r) such that 

J., k , d (X) » x 2sd -^+". 

This completes the proof of the theorem. □ 

Recall that Theorem 11.11 asserts that J s ,k,d(X) <C x 2sd ~ Kd+£ whenever s ^ 
r(fc+l). Consequently, for any positive number v < 1, it follows from Theorem 
13.21 that when is large enough in terms of v and d, such a conclusion is 
impossible if the constraint on s is replaced by 

s > — - — (1 - i/)r(Jfe+l). 
2d + 2 v ; v ; 

Thus, the conclusion of Theorem 11.11 is at worst a factor of essentially 2 + 2/d 

away from the best possible conclusion of its type. 

When d is large and k is small the situation changes. Here, whenever 
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we find that 

K < ~ K ^ + 1 ~ 2s> ih ir + H 1 - -kTi) >0 - 

When d is large enough in terms of k, one finds that 

WTT) (itt) < r + = « k + « (2CET155) (1 + ° (fc/d)) ' 

Thus, when z/ > is small enough in terms of d and k, the lower bound 
Js,k,d{X) 3> X 2sd ~- ft:d+7? holds for a positive number 77 provided that 



In this situation, the conclusion of Theorem II. H is at worst a factor of essentially 
2(1 + \/k)d/k away from the best possible conclusion of its type. 

(c) The system of Arkhipov, Karatsuba and Chubarikov. Here we consider the 
situation with d ^ 2 and I ^ 1 in which F is given by (12. 8p . On recalling 
( 12.1 Op . the weight of such a system with dimension d is K d = ^dl(l + l) d , 
whilst the corresponding weight of such a system with dimension cZ — 1 is 
Kd-i = \{d— As in the argument leading to (I3.12p . an application 

of Theorem 13.11 in this instance delivers the lower bound 

X2sd-K d ^ ^ 

From (12.101) one finds when 

s< \dl(l + l) d (l- (I -l/d)(l + iy l ) 

one has 

K d - K d _ x + 1 - 20 1 + \dl{l + l) d (l - (1 - l/d)(l + I)' 1 ) - 2s > 0. 
Observe that 

\di{i + - (1 - lm + 1)- 1 ) = l \° { l'^\ di + m + i) d - 1)- 

Thus, when v > is small enough in terms of <i and I, one finds that the lower 
bound J S (X; F) ^> x 2sd ~ K+TJ holds for a positive number 7] provided that 

In this situation, the conclusion of Corollary 12.21 is at worst a factor of essen- 
tially 4(1 + 1 /I) away from the best possible conclusion of its type. By way of 
comparison, the work of Arkhipov, Karatsuba and Chubarikov j2] would miss 
the best possible conclusion by a factor of order oHogZ. 
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4. Preliminary manoeuvres 

Our purpose in this section is to describe further notation and establish such 
preliminary estimates as are required to initiate the efficient congruencing pro- 
cedure, with the ultimate objective of proving Theorem 12.11 With this aim in 
mind, let F be a reduced translation-dilation invariant system of polynomials 
having dimension d, rank r, degree k and weight K. Since the conclusion of 
Theorem 12.11 follows from linear algebra when k — 1, there is no loss of general- 
ity in supposing that k ^ 2. We consider the system F to be fixed throughout, 
and consequently suppress mention of F by abbreviating J S (X;F) to J S (X), 
with similar conventions in other notation as appropriate. We recall the nota- 
tion introduced in (I3.2p and (13. 3p . and note the consequence of orthogonality 
recorded in (13. 4p . 

Our argument involves the investigation of systems of congruences, the sin- 
gular solutions of which must be isolated for special treatment. We pause 
at this point to introduce a special Jacobian determinant that facilitates this 
treatment. First, given a polynomial G(z) G Z[zi, . . . , Zj], we write <9jC7 to 
denote the partial derivative of G with respect to the ith variable, so that 

diG(z) = -7^-( z i> ■ ■ -,z d ). 

We now consider a function a : {1, . . . , r} — > {1, . . . , d}, and define the associ- 
ated Jacobian determinant A r (x; a) = A(x 1; . . . , x r ; cr) by 

A(xi, . . . ,x r ; a) = det (d^F^x;))^,. ^ . (4.1) 

We seek a choice for a having the property that A r (x; a) is not identically 
zero as a polynomial in x. With this goal in mind, we consider the monomials 
occurring in F and A r (x;<r), and introduce an ordering on the exponents 
associated with these monomials in order to ease discussion. When t is a 
natural number, and a, b G (NU {0})*, we say that a is less than b in colex 
order when there exists an index i with 1 ^ % ^ t such that cij < 6«, and 
further aj = bj for j > i. In such a situation, we write a -< b, and we 
write a ^ b when a = b or a -< b. Next, given a G (N U {0})', we write 
x a for the monomial The monomials x a may now be ordered 

according to the colexicographical order of their indices. Notice that when Xj = 
(pen, . . . , Xid), then the monomial x^ 1 . . . x^ r has smaller degree than x^ 1 . . . x^ r 
in colex if and only if there exists an index i for which Cj -< d i; and further 
Cj = dj for j > i. 

Lemma 4.1. There exists a function a : {1, . . . , r} — > {1, . . . , d} such that 
A r (x; cr) is non-zero as a polynomial in Xi, . . . , x r . 

Proof. We begin by interpreting the polynomials Fj(z) in terms of the colex 
ordering of monomials. Recall that k = maxdeg(-F } ). Write 



= {a G (N U {0}) d :l^a 1 + ... + a d ^k}, 
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so that the polynomials Fj(z) are necessarily linear combinations of the mono- 
mials z a with a G srf . We put A = card(=e/), and label indices in such a 
manner that 

st = {ai, . . .,& A }, 

with a x -< a 2 -<...-< a^ the colex ordering of the elements of srf ' . Thus, for 
1 ^ j ^ r, there exists an integral A-tuple Cj, which we consider as a row 
vector, having the property that 

Fj(z) = cj ■ (z a \...,z aA ) T . 

Since F 1 , . . . , F r are linearly independent, the matrix 

^ = ( C J')l<j<r 

must have full rank, and hence there exists an invertible r x r matrix ^ with 
rational coefficients having the property that the product & = is a full 
rank matrix in inverted reduced row-echelon form. By the latter we mean that 
if & is the matrix 



(Pl,m) l«r , 



then the corresponding matrix 



{Pr+l-l,A+l-m) l^Kr 

is in conventional reduced row echelon form. We define the r-tuple of polyno- 
mials (Gi, . . . , G r ) by putting 

(d, . . . , G r ) T = JZ(F U . . . , F r ) T = JZ^(z a \ z aA ) T 

= M{z ai ,...,z aA ) T . (4.2) 

Let z hj denote the leading monomial of Gj(z) in colex, for 1 ^ j ^ r. Since 
M is in inverted reduced row echelon form, we have 

bx -< b 2 -< . . . -< b r . (4.3) 

We now define the function a : {1, . . . , r} — > {1, . . . , d} as follows. When 
1 ^ i $C r, we take a(i) to be the smallest index h with the property that 
b ih > 0. 

It remains to verify that with the choice of the function a just made, the 
polynomial A r (x; a) is non-zero. Let ej denote the d- dimensional vector whose 
ith coordinate is equal to 1, all of whose remaining coordinates are 0. When 
1 ^ n ^ r, define the Jacobian determinant D n (x; a) = -D(xi, . . . , x n ; a) by 

.D(xi, . . . , x n ; a) = det(9 CT ( i )Gj(x i ))i <i)J - <n . 

We proceed by induction to show that for 1 ^ n ^ r, the determinant -D n (x; a) 
may be expanded in the shape 

D n (^a) = D^(^a) + D^(^a), (4.4) 

where 

n 

^wnn^o^r^ ( 4 - 5 ) 



0%j 
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and Di 2) (x ;er) is of smaller degree in colex than (x; a) . Notice that the 
definition of cr implies that &i )0 -m • • • b n ,a(n) ^ 0. Then having established the 
inductive hypothesis for n — r, it follows that -D r (x; cr) contains the monomial 

i=i 

with a non-zero coefficient, and hence -D r (x; cr) must be a non-zero polynomial. 
In this way, the proof of the inductive hypothesis will facilitate the proof of 
the lemma. 

When n — 1, we have -D n (x; a) = <9 CT (i)Gi(xi), and hence it follows at once 
that Q£ID holds with 

D\ ; (x;cr) = 6i, ff( i)X 1 

for some polynomial .D j 2 ^ (x; cr) of degree smaller in colex than x^ 1 e<T(1) . Thus 
the inductive hypothesis holds with n = 1. 

Suppose next that the inductive hypothesis has been established already for 
1 ^ n < u. When 1 ^ i,j ^ u, define the polynomials 0y by putting 

UjO -(u)X« when % = j = u, 

0, when ^ (u,u). 

Then the determinant D u (x.; a) has the expansion 

-D u (x; cr) = 8 uu D u -i(x; cr) + det(9 (jW G i (x i ) - %)i<i,j^ u . 

On making use of the inductive hypothesis with n = w — 1 in order to expand 
/} u _i(x; cr), we deduce that 

L> u (x; cr) = 0tM»(-Dt£i(x; cr) + I>£i(x; cr)) + det(9 (7 (i)G j (x i ) - 0y)i<Hi<« 

= D«(x;<r)+Df (x;<r), (4.6) 

where 

Di°)(x; cr) = 6 Mj(T(n) x«"~ eCT( " ) D2 1 (x; cr) + det(^ w G,(x,) - ^■) 1 ^ jKu . (4.7) 

Every term on the right hand side of ( 14. 7p may be expanded as a linear com- 
bination of terms of the shape 

x^ eCT(1) ...x!r eCT(u) , (4.8) 

in which for some permutation 7r : {1, ...,«} — > {1, ...,«}, there exist d- 
tuples b' 1; . . . , h' u with (b' 1; . . . , b' u ) =<! (b 1; . . . , b u ) and h\ ^ hi (1 ^ / ^ u), 
and having the property that 

(hi,...,h u ) = (b' n{1) ,...,b' n{u) ). 

Moreover, in the event that tt is the identity permutation, then one has 
(b' 1 ,...XH(bi ) ---,b u ). 

If ii is the identity permutation, then 

(hi,...,h u ) = (b[, . . . ,h'J -< (b 1; ...,b u ). 
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Since the (strict) colex ordering between tuples is not reversed on component- 
wise addition, a modicum of thought confirms that 

(hi - e CT (i), . . . , h u - e a (u)) -< (bi - e^i), . . . , b u - e CT ( u) ). (4.9) 

Unfortunately, our notation is somewhat opaque, and so it may be worth- 
while to spell out the details underlying this deduction. Since (h 1; . . . , h u ) -< 
(bi, . . . , b u ), there exists an index I with the property that h/ -< b/, and fur- 
ther hj = hj for j > I. But then there exists an index m = m(l) with the 
property that h[ m < bi m , and further hy = by for j > m. On recalling that 
the definition of the function a implies that by = for j < a (I), we deduce 
that m(l) ^ cr(0- Thus we find that h/ — e a n) -< hi — e CT (/), and further that 
hj — e CT (j) = hj — e CT (j) for j > I. We therefore conclude that (14. 9 j) holds, as we 
had previously asserted. 

Suppose next that 7r is not the identity permutation, and let % be maximal 
with 7r(z) 7^ i. Then for j > i one has j = ir(j), and hence 

Since j = ir(j) for j > i, it follows that i > 7r(i), and hence from (14. 3p we see 
that 

= b^j) ^ b,r(j) -< hi. 

It follows that (hi, . . . , h u ) -< (bi, . . . , h u ), so that as above one finds that the 
relation ( 14. 9 p holds also in the situation that tt is not the identity permutation. 
In view of gSD, it follows that D { u\ x; a) is of smaller degree in colex than 
the polynomial Du (x; a) defined by means of (14.51) . We therefore deduce 
from ( I4.6P that the relation (I4.4p holds with n = u, and with Du (x; a) of 
smaller degree in colex than D« (x; a). We have consequently established the 
inductive hypothesis with n = u, and hence the inductive hypothesis holds for 
1 ^ n ^ r. In particular, the determinant -D r (x; a) is a non-zero polynomial. 

We now reverse course in order to relate the non- vanishing of Z) r (x;cr) to 
the non- vanishing of A r (x; a). For each I with 1 ^ I ^ d, it follows from (14. 2 p 
that one has the identity 

(did, • • • , dtGrf = ^(m, . . . , diF r ) T , 

whence 

Consequently, one has 

D r (x.; a) = (det^#)A r (x; a). 

But the matrix is invertible, so that det ^# ^ 0. Our conclusion that 
-D r (x; a) is a non-zero polynomial therefore forces us to conclude that A r (x; a) 
is also a non-zero polynomial. This completes the proof of the lemma. □ 

Henceforth, we fix our choice of the function o : {1, . . . , r} — > {1, . . . , d} 
so that A r (x;cr) is a non- zero polynomial, as permitted by the conclusion of 
Lemma [4.1[ and we write A r (x) for A r (x; a). 
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We next turn to the task of introducing notation with which to describe 
the mean values central to our methods, as well as exponents with which to 
bound these mean values. We refer to the exponent X s = A S (F) as permissible 
when, for each positive number e, and for any real number X sufficiently large 
in terms of s, F and e, one has J S (X;F) <C X Xs+£ . Define A* to be the 
infimum of the set of exponents A s permissible for s and F, and then put 
r] s = A* — 2sd + K. Thus, whenever X is sufficiently large in terms of s, F and 
e, one has 

J S {X) < X x ° +£ , (4.10) 

where 

A* = 2sd - K + r] s . (4.11) 

Note that, in view of the lower bound supplied by Theorem 13.11 and the trivial 
estimate J S (X) ^ X 2sd , one has ^ T] s ^ K for s G N. 

We take 5 to be a small positive number to be fixed shortly. Let u be a 
natural number with u ^ k, put Sq = ur, and fix a natural number s with 
s ^ s . Our goal is to show that A* +r = 2(s + r)d — K, whence i] s+r = 0. 
From this and the definition of A* +r , it follows that there exists a sequence of 
natural numbers (X n )^ =1 , tending to infinity, with the property that 

J s+r (X n ) > Xn s+r ~ S {n e N). (4.12) 

In view of f )4.10p . when X n is sufficiently large and X^f < Y ^ X n , we also 
have the corresponding upper bound 

J s+r (Y) < Y x ^ +S . (4.13) 

Notice that since s ^ so, the trivial inequality \f(oc;X)\ ^ X d leads to the 
upper bound 

J s+r {X) <: X 2 ^ s - S ^ d j> \f(a;X)\ 2so+2r da = X 2 ^- S0 ^ d J S0+r (X). 

It follows that one has r] s+r ^ r] So+r , and so we are free to restrict attention to 
the special case s = sq. Finally, we take N to be a natural number sufficiently 
large in terms of s and F. We then put 

9 = N-V 2 (r/s) N+2 , (4.14) 

and fix S to be a positive number with S < (Ns)~ 3N , so that 5 is small compared 
to 9. We now consider a fixed element X = X n of the sequence (X n ), which 
we may assume to be sufficiently large in terms of s, F, N and 5, and we put 
M = X e . Thus, in particular, one has X s < M 1 ^ . 

Let p be a fixed prime number with M < p ^ 2M to be chosen in due course. 
That such a prime exists is a consequence of the Prime Number Theorem. 
Recall the notation introduced in (13.21) . When c is a non- negative integer, and 
£ G Z d and ct G [0, l) r , define 

f c (a;£)= e W x ;«))- ( 4 - 15 ) 

x=£ (mod p c ) 
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We need to equip ourselves with exponential sums of a type related to that de- 
fined in f)4.15p . but possessing inherently non-singular structure. We introduce 
the notion of well-conditioned r-tuples (x 1; . . . , x r ) with x, G Z d (1 ^ j ^ r). 
We denote by H c (£) the set of r-tuples £ = . . . , £ r ) G (Z d ) r , with 

1 < & < P c+1 and = $, (mod p c ) (1 < i < r), 

and satisfying the property that A(£ l5 . . . , £ r ) ^ (mod p( K ~ r ) c+1 ). In addi- 
tion, write S r = {1, — l} r , and consider an element er of S r . Recalling the 
definition (I4.15p . we then put 

r 

?es c (4) *=i 

Next we introduce the two mean values that underly our arguments. When 
a and b are non- negative integers, and cr, r G E r , we define 

J a CT 6 (X; Z,ri) = f |^(a; £) 2 f 6 (a; r?) 2 1 da, (4.17) 

and 

jsyt*; e i,) = j 2 ^"(«; ^) 2 1 da. (4.is) 

We then define 

I a b(X) = max max max I^JX; 77), (4.19) 

and 

AT a6 (X) = max max max K° b T (X; £, 77). (4.20) 

l^^s£p a l^TJ^pi) <T,TSS r 

We stress that, although these mean values depend on our choice of p, this 
dependence will shortly be rendered irrelevant when we fix our choice of p 
once and for all. Consequently, we suppress mention of p in our notation. 

Finally, following the simplifying notational device of [211 §3] , we define the 
normalised magnitude of the mean values J s+r (X) and K a ^{X) as follows. We 
define [[J s+r (X)]] by means of the relation 

J s+r (X) = X 2 ^ d - K [[J s+r (X)}], (4.21) 

and when ^ a < b, we define [[ila^X)]] by means of the relation 

K a , b {X) = (X/M a ) 2rd - K (X/M b ) 2sd [[K a , b (X)}}. (4.22) 

Note that in view of (14.111) . the lower bound (I4.12p implies that 

l[J s+r (X)]} > X^- 5 , (4.23) 

while the upper bound (14.131) ensures that whenever X s2 < Y ^ X, then 

[[J s+ r(Y)]\ < Y^ +S . (4.24) 
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5. Auxiliary mean values 

We collect together in this section several mean value estimates that facili- 
tate our subsequent analysis. We begin by exploiting the translation-dilation 
invariance of the system F so as to bound an analogue of J S (X) in which 
variables are restricted to an arithmetic progression. This argument will be 
familiar to those who work on such problems. 

Lemma 5.1. Suppose that c is a non-negative integer with c9 ^ 1. Then for 
each natural number t, one has 

max /|f c (c*;£)| 2 'da< t J t (X/M c ). (5.1) 

Proof. Let £ be an integral ci-tuple with 1 ^ £ ^ p c . By orthogonality, it 
follows from (I4.15P that the integral on the left hand side of (15.11) is bounded 
above by the number of integral solutions of the system 

t 

E( F 0° c y* + *) - F 0° Cz * + *)) = °> ( 5 - 2 ) 

with ^ y, z ^ X/p c . The translation-dilation invariance of the system 
F discussed in the context of (12.11) and (12.41) shows that y, z is an integral 
solution of (I5.2p if and only if 

J>(y,) - F(z ?; )) = 0. (5.3) 

i=l 

But on recalling (13.31) and employing orthogonality again, we perceive that the 
number of integral solutions of (15.31) with ^ y, z ^ X/p c is equal to 

j |1 + /(a; X/p c )\ 2t da « t 1 + j |/(a; X/p c )\ 2t da. 

Thus we obtain the upper bound 

^|f c (a;£)| 2t da<l + J t (X/p c ). 

Since the condition c8 ^ 1 ensures that X/M c ^ 1, a consideration of diagonal 
solutions ensures that Jt(X/M c ) ^ 1, and the conclusion of the lemma follows 
on noting that J t (X/p c ) «C J t (X/M c ). □ 

Singular solutions associated with the vanishing of A r (x) are difficult to 
control, and so we prepare a lemma to bound their number. We first introduce 
some additional notation. Let F be a field. We restrict attention to either 
the field of rational numbers Q, or else the finite field of p elements ¥ p . The 
coefficients of A r (x) embed into both of these fields. Suppose that s$ C F 
is finite, and write A = card(^Z). Let t be a natural number. We denote by 
y t (^ ; F) the set of t-tuples (xi, . . . , x t ) G (=c/ d )' having the property that the 
determinants A r (x J1 , . . . , x Jr ) vanish for all r-tuples (ji, . . . ,j r ) with 1 ^ j ^ t. 
Before announcing our main estimate for card(J?^(.2/; F)), we first recall a 
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familiar lemma bounding the number of zeros of polynomials in many variables 
(see [HI Lemma 2], for example). 

Lemma 5.2. Let T G F[yi, . . . , y u ] be a non-trivial polynomial of total degree 
k. Then the number of solutions of the equation T . . . , y u ) = with y G stf u 
is at most kA u_1 . 

Proof. We proceed inductively. When u — 1, the desired conclusion is a con- 
sequence of Lagrange's theorem. Suppose then that the conclusion of the 
lemma has been established for 1 ^ u < v, and let ^ G ¥[yi, . . . ,y v ] be 
a non-trivial polynomial of total degree k. By relabelling variables if nec- 
essary, we may suppose that \I/ is explicit in y v . Let the degree of \I> with 
respect to y v be u, and let the coefficient of y^ be $(yi, . . . , y v -i)- Then $ 
is a non-trivial polynomial in v — 1 variables of degree at most k — u>. By 
the inductive hypothesis, the number of solutions of . . . , = with 
(yi, . . . ,y v -i) G .s^ -1 is at most (k — w)^ -2 . Then the number of ^-tuples 
(yi, . . . , y v ) G .c/ 1 ' satisfying $(yi, . . . , = is at most (« — a;)A u_1 . Mean- 
while, when $(yi, y v -i) 7^ and ty(y%, . . . ,y v ) = 0, then y„ satisfies a 
non-trivial polynomial of degree u determined by y%, . . . ,y v -i- So there are 
at most wA"" 1 solutions of \&(y) = with y G srf v and $>(yi, . . . , y v -i) 7^ 0. 
We conclude that the total number of solutions of \I/(y) = with y G stf v is 
at most (k — cj)v4 , '~ 1 + uA 1 '^ 1 = kA v ~ 1 , and hence the inductive hypothesis 
follows for u — v. The desired conclusion therefore follows by induction. □ 

Lemma 5.3. Suppose that A r (x) is not identically zero as a polynomial in F. 
Then 

card(^(^;F)) < ^4*(<*-i)+r-i_ 

Proof. The conclusion of the lemma is trivial when t < r, so we may as- 
sume that t ^ r. We define a sequence of non-zero polynomials £^(x) = 
^i(xi, . . . , Xj) (0 ^ i ^ r) as follows. We begin by setting fS^(x) = A r (x). 
Suppose then that for some I ^ 1 we have constructed the polynomials S^(x) 
for / ^ i ^ r. Amongst the monomials x hl . . . xj 1 ' occurring in f^(x), let 
denote the largest of the <i-tuples h; in colex. It follows that there exist 
polynomials ^_i(x) and ^(x) having the property that 

^(xi,...,x,) = ^_i(x 1) ...,Xi_ 1 )x| 3l +^(x 1 ,...,x z ). (5.4) 

We may suppose that £^_i(x) is non-zero, and that every monomial x hl . . . xj 1 ' 
occurring in ^(x) satisfies h; -< b/. In this way, we have defined polynomials 
^i(x) for ^ 2 ^ r. Notice here that ^o(x) is a non-zero element of F. 

Consider an integer j with 1 ^ j ^ r, and denote by 38 j the set of all 
j-element subsets of {1, 2, . . . , t}. We define ^ to be the set of t-tuples 
(xi, . . . , x t ) G (£/ d y satisfying the property that (a) for each subset {h, . . . ,lj} 
in 38 one has ^-(x^, . . . ,xl) = 0, and (b) whenever i < j, there exists a sub- 
set {mi, . . . , rrii} in 38 i such that ^(x mi , . . . , x m J ^ 0. A moment of reflection 
reveals that y t {^ ; F) is the union of . . . , 3T r . 

We next seek to bound card(^) for each integer j with 1 ^ j ^ r. Consider 
a t-tuple (xi,...,x t ) G There exists a subset {mi,...,m 3 -_i} in 
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with ^_i(x mi , . . . , x mj . J 7^ having the property that for each index m with 
1 ^ m ^ t for which m ^ {mi, . . . , mj-i}, one has i^-(x mi , . . . , x mj ^ , x m ) = 0. 
In view of the relation (5.4), the latter equation implies that the <i-tuple x m 
satisfies 

^3'-l(^mi > • • • > ■X-mj-i "I - (^mi j • • • j -K-r%-_i j X m ) 0, (5-5) 

in which the second term on the left hand side is of smaller degree in x m in 
colex than the first term. Notice that since ^_i(x mi , . . . , x m ) ^ 0, then in 
particular the equation (15. 5p is non-trivial as a polynomial equation in x m . In 
this way, we deduce from Lemma I5T21 that for each index m with 1 ^ m ^ t for 
which m ^ {m 1; . . . ,rrij_i}, the number of <i-tuples x m G srf d satisfying (15. 5p 
is at most 

(b jl + ... + b jd )A d - 1 ^ (k-l)A d -\ 

The total number of choices of x m e for all such indices m is therefore 
at most ((k — l)^ -1 )* - ^' -1 ). The number of choices for (x mi , . . . , x mj _ 1 ) E 
{s/ d y~ l , meanwhile, is at most A d ^ _1 \ Since a trivial estimate confirms that 
card(^_i) ^ t^ -1 , we deduce that 

card(^) *C V-\(k - l)^- 1 )*-^ 1 ^) 3 '- 1 < 

Combining the contributions of . . . , 3T r , therefore, we conclude that 

r r 

card(^K;F)) = ^card(^) < ^ (d_1)+J ' _1 < A*^ 1 ^" 1 . 

This completes the proof of the lemma. □ 

We are now equipped to initiate the iterative procedure. It is at this point 
that we fix our choice for the prime number p. 

Lemma 5.4. There exists a prime number p with M < p ^ 2M for which 
J s+r (X)«M 2 ^J 0il (X). 

Proof. The mean value J s+r (X) counts the number of integral solutions of the 

system 

2(s+r) 

(-l) l F(x ?; ) = 0, (5.6) 

i=l 

with 1 ^ x ^ X. Let To denote the number of such solutions in which 

A(x il ,...,x ir ) = (5.7) 

for all indices i\ (1 ^ I ^ r) satisfying 

l^i^2(s + r). (5.8) 

Also, let T\ denote the corresponding number of solutions in which (15. 7p fails 
to hold for some index i satisfying (15. 8p . Then J s+r (X) = T + T\. 

We first consider T . Put ^ = {1, 2, . . . , [X]} and 5? = ^ 2s+2 r(^;Q)- 
Then on recalling the discussion in the preamble to the statement of Lemma 
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15. 2\ we see that whenever x is counted by To, one has x G S fi . It therefore 
follows from Lemma [5.31 that 

T < card(^ 2s+2r K; Q)) « X 2( s +r)( d -i)+r-i_ 

Note that our hypotheses ensure that s #J rk. Then since 

r 

K = ^ kj < rife, 

we find that s ^ K. In view of the lower bound on J s+r (X) available from 
Theorem 13.11 we deduce that 

T < x 2 (' +r > d - 2 *- 1 < x 2 ^^- 1 < X' 1 J s+r (X). (5.9) 

We next turn to the solutions counted by T\. We begin by examining the 
determinant A r (z). On recalling (14. ip . we see that the (j, i)-th entry of the 
matrix associated with the determinant A r (z) has degree at most kj — 1. As 
a polynomial in z, therefore, we see that the degree of A r (z) satisfies 

r 

deg A r (z) ^ ^(fy - 1) = X - r. 

i=i 

The coefficients of the monomial entries of A r (z) depend at most on F, and 
so for sufficiently large values of X, the sum of the absolute values of these 
coefficients is bounded above by X. Thus we see that 

max |A r (z)| < X K . 

Let denote any set of [K/ 8} + l distinct prime numbers with M < p ^ 2M. 
Such a set exists by the Prime Number Theorem, since we are at liberty to 
assume X to be large enough in terms of K and 9. It follows that 

np > M K/e = X K > max |A r (z)|. 

Consequently, whenever 1 ^ z ^ X and A r (z) ^ 0, then there exists a prime 
p G 2? for which p \ A r (z). In particular, for each solution x of (15. 6p counted 
by Tl, there exists an index i satisfying (15.81) and a prime p G & for which 
A r (x il , . . . ,Xi r ) ^ (mod p). 

Let denote the set of all indices i satisfying (15.81) . and define er = <x(i) by 
putting <x(i) = ((-l)* 1 , . . . , (-l) ir )- Also > P ut 

Z(i) = (-l)«+... + (-l)v m (i) = i( r + Z(i)), n(i) = |(r-Z(i)). 

Then on recalling the definition ( I4.16P and considering the underlying Dio- 
phantine equations, we see that 

T « <fr {i \<x-,o)f( a] xy +r - m f(- a] xy +r - n d a . 

An application of Schwarz's inequality therefore reveals that 

|S r r (a;0) 2 /(a;X) 2s |da) U |/(a;X)| 2s+2r da V ~ 
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Recalling now (13.41) and (14.191) . we deduce that there exists a prime number p 
with M < p < 2M for which 

T x « (I 0ja {X)) 1 ' 2 (J a+r (X)) 1 ' 2 . 

By reference to (15.91) . we therefore arrive at the upper bound 

J s+r (X) = T + T 1 « X- 1 J s+r (X) + {I^{X)) l l 2 {J s+r {X)) l '\ 

whence 

J s+r (X) < 1 + J 0)0 (X) < 7 0j0 (X). (5.10) 

Our final step is to split the summation in the definition (I4.15P of fo(cK; 0) 
into arithmetic progressions modulo p. Thus we obtain 

fo(o;0)= J2 
and so it follows from Holder's inequality that 

In this way we deduce from (I4.17P and (14. 19j) that 

I ,o{X) < {M d ) 2s max max / 0) 2 f 1 (o:; £) 2s | da < M 2sd I 0)1 (X). 

The proof of the lemma is made complete by substituting this last estimate 
into flBTTOjl . □ 

This is the point at which we fix the prime number p, once and for all, in 
such a way that the estimate J s+r (X) <C M 2sd I ^{X) holds. That such a 
choice is possible is guaranteed by the conclusion of Lemma 15.41 

6. Auxiliary congruences 

The main thrust of our argument begins with a discussion of the congruences 
that play a critical role in what follows. We first introduce some additional 
notation. When a G S r , denote by ^^ b (m; £, 77) the set of solutions of the 
system of congruences 

r 

£ ViFjizi -rj)= m, (mod p*' 6 ) (1 < j < r), (6.1) 

i=l 

with 

1 ^ z ^ p kb , Zi = £ (mod p a ) (1 ^ i ^ r) (6.2) 

and 

A(zi, . . . , z r ) ^ (mod ( 6 3 ) 

The non- singularity condition (16.31) is awkward to handle directly, and so we 
simplify it using the condition (16 .2p by means of the following lemma. 

Lemma 6.1. One has the polynomial identity 

A(t Zl +£,...,tz r + Z) = t K ~ r A(z u . . . , z r ). 
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Proof. It suffices to establish the claimed identity in the special case t = 1, 
since the homogeneity of the polynomials Fj (x) ensures that 

r 

A(tz x , . . . , tz r ) = (J] t*'" 1 ) A(z ls . . . , z r ) = ^- r A(z l5 . . . , z r ). 

3=1 

Consider then the situation with t = 1, and recall the relation (12.31) . By the 
chain rule, we find that for 1 ^ I ^ d one has 

_ (x+ e = _ (F(X + Q) = ^ (c( e)P W ) = c«)a,F (x ). 

The definition (14. ip of A r (x) therefore delivers the relation 

A(xi + £,..., x r + £) = det(9 (7(i )F i (x i + C))ui,Kr 

= det (C(^)(9 (T(i) F j (x J ))i^ iKr ) 
= (detC(0)A(x 1; ...,x r ). 

But C(£) is lower unitriangular, so that detC(£) = 1. We therefore conclude 
that 

A(xi +£,..., Xr + €) = A(xi, . . . , x r ), 
and in view of our earlier discussion, the proof of the lemma is complete. □ 

We also require an analogue of Hensel's lemma in order to lift solutions of 
congruences to progressively higher moduli. A suitable version of this lifting 
process is implicit in the next lemma. 

Lemma 6.2. Let fx, ■ ■ ■ , ft be polynomials in Z[xi, . . . , xt] with respective de- 
grees Kx, ■ ■ ■ , Kf, and write 



J(f;x) = det(||(x) 



When w is a prime number, and I is a natural number, let jV{i\w 1 } denote 
the number of solutions of the simultaneous congruences 

fj(xx, ...,x t )=0 (mod w l ) (1 < j < t), 
with 1 ^ Xi ^ w l (1 ^ i ^ t) and (J(f; x), w) — 1. Then ^(f; w l ) ^ Kx ■ ■ ■ 
Proof. This is [T9| Theorem 1]. □ 

We are now equipped to establish the basic estimate for the number of 
solutions of the congruences (16.11) comprising ^^ b (m; £, 77). 

Lemma 6.3. Suppose that a and b are non-negative integers with b > a. Then 

max max max card(^ 6 (m; £, 77)) Oi ■ ■ ■ k r p^ b ~ a)rd ~ K ^- a) . 
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Proof. Consider fixed integers a and b with ^ a < b, a fixed r-tuple cr G S r , 
and fixed integral <i-tuples £ and r\ with 1 ^ £ ^ p a and 1 ^ 77 ^ p b . Denote 
by ^i(n) the set of solutions of the system of congruences 

r 

^F(z, - rj) = n (mod p fc6 ), (6.4) 

with 1 ^ z ^ p kb and 

z.j = (mod p a+1 ) for some £ G H a (£) (1 ^ i ^ r). (6.5) 

Given a fixed integral r-tuple m, the number of r-tuples n with 1 ^ n ^ p kb , 
for which rij = (mod p k i b ) (1 ^ j ^ r), is equal to 

r 

JJ p (*-*i)& = (j)byk-K_ 
0=1 

Then it follows from (16.1 p that 

card(^ 6 (m; £,»;)) = ^ ... ^ card(^(n)) 

niEmi (mod p fe i 6 ) n r =m r (mod p fer! ') 

(p 6 ) 7 *"* max card(^i(n)). (6.6) 

We next rewrite each variable Zj in the shape Zj = p a yj + Notice that 
the hypothesis that z, = £j (mod p a+1 ) for some £ G S (£), recorded in (16.51) . 
implies that £ { = p a Vj + £ for some integral ci-tuple v,, and further that 

A(£ 1 ,...,£ r )^0(mod p^ a+1 ). 



But as a consequence of Lemma 16.11 one then has 

p (*-r)a A(vij ^ Vr) = A( ^ i) (mod p^ a+1 ), 

whence 

A(vi, . . . ,v r ) (mod p). 
However, for 1 ^ z ^ r one has 

p a v< + £ = & = z, = p a yi + £ (mod p a+1 ), 

so that y» = Vj (mod p), and hence 

A(yi, . . . , y r ) = A(vi, . . . , v r ) ^ (mod p). 

With the substitution z, = p a yj + £ in (16.41) . therefore, we deduce that the set 
of solutions ^i(n) is in bijective correspondence with the set of solutions of 
the system of congruences 

r 

^F(p a y t + £ - rj) = n (mod p kb ), (6.7) 



8=1 



with 1 ^ y ^ p a and A r (y) ^ (mod p). 
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Let y = w be any solution of the system (16. 7p . if indeed such a solution 
exists. Then it follows that all other solutions y satisfy the system of congru- 
ences 

r r 

*iF(P a yi + ^-ri) = J2 ^ F (P a ^ + €-^7) (mod p kb ). (6.8) 
i=i i=i 

The translation invariance formula (12. 3p implies that 

F(x)=C(O- 1 (F(x + |)-c (O), 

in which the matrix C(£) has determinant 1. It follows that the system of 
congruences (I6.8P is equivalent to the new system 

r r 

5>iF(p a yO = J^a-iF^Wi) (mod p kb ). 

i=\ i=l 

By homogeneity, moreover, this new system is in turn equivalent to 

r r 

]T o^-fa) = ]T ^^-(Wi) (mod p fcfe - fc ^) (1 < j < r). 

i=l i=l 

Next, we write 5^(u) for the set of solutions of the system of congruences 

r 

(TiFjiyi) = Uj (mod p kb - k n (l^j^r), 

i=l 

with 1 ^ y ^ p kb - a a nd A, r (y) ^ (mod p). Then it follows from our discus- 
sion thus far that 

card(^i(n)) < max card(^ 2 (u)). (6.9) 
Denote by 5^(v) the set of solutions of the system of congruences 

r 

J2^F( yi )=Y (modp kb ~ a ), (6.10) 
i=i 

with 1 ^ y ^ p kb - a a nd A r (y) ^ (mod p). Then we have 

card(^ 2 (u)) < ... card(^ 3 (v)) 

uisui (mod p fci> - fe i a ) v r =u r (mod pkb-k r a^ 

^ (p a ) K ~ r max card(^ 3 (v)). 

l^v^p kb - a 

Combining this estimate with (16. 9p and ( 16. 6p . we derive the upper bound 

card(^ 6 (m;£,T/)) < (p b ) rk ~ K (p a ) R - r max card(^ 3 (v)). (6.11) 



kb—a 



It is at this point that we prepare to apply Lemma 16.21 For 1 ^ % ^ r, we 
consider a fixed choice for the d — 1 coordinates yij with j ^ cr(i). We then 
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define the polynomials 

r 

/j(yv(i), • • • >2/r,«7(r)) = 53 ^-(yt) - vj (i < j < r). 

1=1 



Consider the solutions y of the system (16.101) lying in ^(v). For 1 ^ i ^ r, 
there are at most p fcb_a possible choices for each coordinate yij with j ^ o~(i). 
Write y = (yx^ru, . . . , y r ,a-(r))- Then in the notation of the statement of Lemma 
one has 

J(f;y) = det(d a ^F j (y i )) 1<id<r = A r (y) ^ (mod p). 



Thus we may apply Lemma [6.21 to show that the number of solutions 

2/1,(7(1)) • • • ,Vr,a(r) 

of the system of congruences 

/j (2/i,cr(i)5 • • • ,yr,*(r)) = (mod p kb - a ) (1 < j < r), 

with 1 ^ J/i,cr(j) ^ p fc6 - a (1 ^ ? ^ r), is at most k\...k r . In this way, we 
conclude that 

card(^ 3 (v)) < ^ ... ^ fc a . . . k r 

= fc 1 ...Av(p fc6 ~°) r(<i ~ 1) - 
Substituting this estimate into (16. lip , we arrive at the upper bound 

card(^(m; £, r,)) < A* . . . V^T^, 
and the conclusion of the lemma follows at once. □ 



7. The conditioning process 

Our next step involves extracting from the mean value I^ b (X; £, 77) associ- 
ated mean values conditioned so as to avoid singular solutions of an underlying 
system of congruences. Although motivated by the corresponding treatment 
for the classical Vinogradov system in [2TJ §5], the less digestible singularity 
condition of the present work demands some modification. 

Lemma 7.1. Let a and b be integers with b > a ^ 0. Then one has 

I a , b (X) « K a>b (X) + M^-^-H^iX). 

Proof. Consider fixed integral rf-tuples £ and 77 with 1 ^ £ ^ p a and 1 ^ r\ ^ 
p b , and an r-tuple er G S r . Then on considering the underlying Diophantine 
system, one finds from ( I4.17|) that I° b (X; 77) counts the number of integral 
solutions of the system 

r s 

*i(F(*) - F(y,)) = $>( v <) - F ( v ^))' ( 7 - X ) 

i=i 1=1 
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with 

l^x,y,v^X, vi = 77 (mod p h ) (1 < I ^ 2s), 
and satisfying the property that there exist 

(£i,...,£ r ) G 2 a (£) and (d,...,C) eS a (£) 

for which 

Xj = £• (mod p a+1 ) and = Cj (mod p a+1 ) (l^i<r). 

Let Ti denote the number of integral solutions x, y, v of the system (17. II) . 
counted by I° b (X; 77), satisfying the condition that for all r-tuples (1%, . . . , l r ) 
with 

1 < 1 < 2s, (7.2) 

one has 

A(v ill ...,v (r )E0(mod^ +1 ). 

Also, let T 2 denote the corresponding number of solutions satisfying the con- 
dition that for some r-tuple 1 satisfying ( 17. 2p . one has 

AK, . . . , v, r ) ^ (mod p^ b+1 ). (7.3) 

Then we have 

i: tb (X;tv)^T l + T 2 . (7.4) 

We consider first the solutions counted by T\. Suppose that x, y, v is a 
solution counted by T\. For each index / with 1 ^ I ^ 2s, we may rewrite 
v; in the shape vj = p b ui + 77, for some integral <i-tuple u;. Given an r-tuple 
(li, . . . , l r ) satisfying (17.21) . it follows from Lemma f67T1 that 

(p b ) K - r A(u h , u /r ) = A(p b u h + 77, . . . ,p b u lr + 77) 

= AK,...,v lr )E0(mod p^ b+1 ), 

whence 

A(u h , . . .,Ui r ) = (mod p). 

Write srf = {1,2, ... ,p} and F = Z/pZ. Then it follows that u = u (mod p) 
for some V G ^ 2s {s^ ; F). Define 3% to be the set of 2s-tuples [r\ x , . . . , r] 2s ) 
with 1 ^ rj l ^ p b+1 (1 ^ / ^ 2s) satisfying the property that (77^ . . . , 77 2s ) = 
(77 + p b v>\, . . . , 77 + p b u 2s ) for some {u\, . . . , u 2s ) G Sf 2s {srf \ F). Then we have 
(vi, • • • , v 2s ) = (77 1; . . . , 77 2s ) (mod p b+l ) for some (77^ ... , 77 2s ) G 3%. 
On considering the underlying Diophantine system, we deduce that 

Ti « j £)H W«l »7i) • • • rj 2s )\ da. 

In view of the elementary inequality 

\z± . . . z n \ ^ \z\ + . . . + \z n \ n , 
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we find from (14.1 7p that 

T i « E E f *) Vi(«; ^) 2s l da 

< card(^) max I£ b+1 (X; £, r/ ) . 



But as a consequence of Lemma 15.31 one has 

card(J^) = card(^ 2s (^;F)) < p M<*-i)+r-i 

Thus we conclude from (I4.19|) that 

Ti <<; M 2s ( d - 1 ) +r - 1 J 0i6+1 (X). (7.5) 

Next we turn our attention to the solutions x, y, v counted by T 2 . We may 
suppose that for some r-tuple 1 satisfying (I7.2p . one has the congruence (17.31) . 
We define t% for 1 ^ i ^ r by taking r» = 1 when 1 ^ ^ s, and Tj = — 1 
when s + 1 ^ /j ^ 2s. Notice that since v; = r\ (mod (1 ^ I ^ 2s), 
the condition (17. 3p implies that (vj i; . . . , Vj p ) = (i/^, . . . , i/j r ) (mod p fc+1 ) for 
some (i/jj, . . . , i/j r ) G H&(t/). Thus, on considering the underlying Diophantine 
system, we obtain the upper bound 

t 2 « £ i"|C(«;0 2 ^(«;»7)f6(«;^) 2s " r |da. 

T6Sr 

On recalling that s = w, an application of Holder's inequality reveals from 
(14.171) and (I4.18P that for some r G E r , one has 

l/(2u) 



l-l/(2u) 



T 2 «(i"|^(a;OX(a;r7) 2 1da 
x (^|^(a;£) 2 f 6 (a;r7) 2s |da 



We therefore conclude via (14.191) and (I4.20p that 

T 2 « (K a , b (X)) 1 ^ u \l a , b (X)) 1 - 1 ^. (7.6) 
On combining (17.51) and (17. 6p with (17.41) . we deduce that 

I aib (X) « M^-^-H^X) + {K atb {X)f/^\l ath {X)) l - l l^\ 
whence 

I a>b (X) « *T a , 6 (X) + M 2s{ - d -^ +r - l I M {X). 
This completes the proof of the lemma. □ 

Repeated application of Lemma 17.11 shows that whenever a, b and H are 
non- negative integers with b > a ^ 0, then 

H-l 
/i=0 

We next show that / ai 6 + //(X) is negligible for if large enough. 
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Lemma 7.2. Let a, b and H be non-negative integers with 

< b - a «C H <: -1 - b. 

Then one has 

(M H ) 2s( - d ~ r>+r ~ 1 I aib+H (X) < M' H / 2 (X/M b ) 2sd (X/M a ) 2rd - K+r ^. 

Proof. On considering the underlying system of Diophantine equations, we find 
from (I4.17P that when 1 ^ £ ^ p a and 1 ^ r] ^ p b+H , and cr 6 E n one has 

I° b+H (X; £, ij) < y |f.(a; £) 2, W(a; r/) 2s | da. 

An application of Holder's inequality in combination with Lemma I5TT1 therefore 
yields the estimate 

r/(s+r) 



s/(s+r) 



x U\h +H {ct-rj)\ 2s+2r <icx 



< ( J s+r (2X/M a ) ) r /( s+r ) ( J s+r (2X/M b+H )) s ^ s+r ^ . 
Consequently, on recalling (I4.2ip and (14. 24 p . we obtain the upper bound 

< X s (X/M a ) 2rd ~ K+11 ^ r (X/M b ) 2Ml r, (7.8) 

where 

y _ (j^ 6 ~ a+H\Ks/(s+r) j^— 2sdH 

But when H ^ b — a, one has 

#(2s(d - 1) + r - l)+(6 - a + H)Ks/{s + r) - 2sd# 

^ #(r - 1 - 2s) + 2HKs/(s + r). 

On observing that s ^ rk ^ K, and hence (s + r) 2 > Kirk + r) > iTr, one 
finds that the expression 2Hs — 2HKs/(s + r) achieves its minimum value for 
s ^ r/c when s = rk. Hence we deduce that 

H(2s(d - 1) + r - l)+(6 - a + H)Ks/(s + r) - 2sdH 

s^-H + H(r - 2rk) + 2HKrk/(rk + r) 

^-H + H(r - 2rk/(k + 1)) < -H. 

In this way, we see that for k ^ 2, one has 

whence 

The conclusion of the lemma follows on substituting this estimate into (17.81) . 

□ 

Combining Lemma I7T21 with the upper bound (17.71) . we conclude as follows. 
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Lemma 7.3. Let a and b be integers with ^ a < b, and put H = b — a. 
Suppose that b + H ^ 9~ l . Then there exists an integer h with ^ h < H 
having the property that 

I a , b (X) « {M h ) 2s ^ + ^ l K aMh {X) 

+ M~ H/2 (X/M b ) 2sd (X/M a ) 2rd ~ K+r ' a+r . 

The special case of Lemma 17.31 with a = and b = 1 yields a refinement of 
Lemma 15.41 more easily utilised in what is to come. 

Lemma 7.4. One has J s+r (X) < M 2sd K 0A (X). 

Proof. When a = and b = 1, one has b — a = 1. Thus we deduce from Lemma 
17.31 that 

Jo.iPO < K ,i(X) + M - 1/2 {X/M) 2sd X 2rd - K+ ^. 
Since we may suppose that M > X 4S , it follows from Lemma [5.41 that 
J s+r {X) < M 2sd I 0A {X) < M 2sd K 0il {X) + x 2 ^ d - K+r >^- 2S . 
But in view of QOTj and (gjZSD, one has 

J s+r {X) > X 2{s+r)d - K+ ^+ r - & , 

and thus we reach the upper bound 

J s+r (X) < M%(I) + I 4 J s+r (I). 
The conclusion of the lemma follows on disentangling this inequality. □ 

8. The efficient congruencing step 

The mean value K a ^(X) contains the powerful congruence conditions which 
drive the iterative process. In this section we convert these conditions into a 
form suited for further iteration. 

Lemma 8.1. Suppose that a and b are integers with ^ a < b ^ 6~ l . Then 
one has 

K a>b {X) « M 2 ( fcfe - a ^-^( 6 - a )(J s+r (2X/M b )) 1 - r / s (V fe (X)) r / s . 

Proof. Consider fixed r-tuples £ and r] with 1 ^ £ ^ p a and 1 ^ rj ^ p b , and 
r-tuples cr, r G E r . Then on considering the underlying Diophantine system, 
one finds that K°^{X\ 77) counts the number of integral solutions of the 

system 

r u r 

*i(F(xi) - F(y<)) = ^ J] r ™( F K«) " F ( w ^)), (8.1) 

i=l 1=1 m=l 

in which, for some r-tuples £,17 G H (£), one has 

l^x,y^X, x = £(modp a+1 ) and y = F (mod p a+1 ), 
and for 1 ^ I ^ u, for some Jt, h Q\ G one has 

1 ^ V|, Wj ^ X, V/ = 7*^ (mod p b+1 ) and W; = 0; (mod p b+1 ). 
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The translation invariance formula ( 12 .3p implies that the system ( 18. ip is equiv- 
alent to the new system of equations 

r u r 

^(F(xi - ry) - F(y 2 , -rj)) = J2Yl r -( F ( v ^ " *l) ~ F ( w *™ " l))- (8.2) 

i=l !=1 m=l 

In any solution x, y, v, w counted by K^^(X;^,rf), one has V; = = 
r] (mod p b ) (1 ^ / ^ u). We therefore deduce from (18.21) that 

r r 

ViFi(*i - V) = J2 ~ ( mod P kjb ) 0-<3<r). (8.3) 

t=l i=l 

We also have x = y = £ (mod p a ), 

A r (x) ^ (mod p(*- r )«+i) and A r (y) ^ (mod p ( K - r > +1 ). 

Recall the notation introduced prior to the statement of Lemma I6.1[ and 
write 

r 

<8S; 6 (a;&f7;m)= £ IIW 7 ^)- 

On considering the underlying Diophantine system, we deduce from (18. ip and 
(jHD that 

K${X&ri) = ■■■ W|^>;^;m) 2 SH^) 2 1 da. (8.4) 

mi=l m r =l 

By applying Cauchy's inequality in combination with the estimate supplied by 
Lemma [6 .3[ we have 

r 

|<9£ 6 (a; £, ry; m) | 2 ^ card(^ 6 (m; £, l)) ^ J] |f fc6 (a; O ? 

Ce.«- b (m;^,T,) i=l 
r 

« M ( kb - a > d ~ K ^ Yl II i^( a ; c<) I 2 - 

Substituting this relation back into (18.41) and considering the underlying Dio- 
phantine system, we find that 

«M WH /(nif fc6 (a;C)r)|^(«;r7)| 2ll d«. (8.5) 

C=4 (mod p a ) 



2r 
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Observe next that by Holder's inequality, 

C=4 (mod p a ) C=€ (mod p a ) 

^ ipkb - a)( r-i) d J- |fto(a;C)| 

C=£ (mod p a ) 

Then (18.51) delivers the upper bound 

ir a T(X;^r7)«M 2 ( fc6 -^-^- a ) max / |f w (a; C) 2 X(«;*7) 2 1 da. (8.6) 
Another application of Holder's inequality yields the bound 
/ |f»(a; C) 2r ^(«; *?H da < U^'XTj' 

where 

«/i = jw b {cx;ri)\ 2u+2 dcx 

and 

t/ 2 = /|^(«;^) 2 fo(«;C) 2 1da. 



'2 5 



On considering the underlying Diophantine system, it follows from Lemma I5TT1 
that 

U x ^j |f 6 (a; ry)| 2s+2r da « J s+r (2X/M fe ), 
whilst from (14.171) we have U 2 = Il kb {X; rj, £). Thus we conclude that 

j> \f kb (a; C) 2r r b (<*; rj) 2u \ da « ( J s+r (2X/M b )) l - r ' s {I bM {X)y/ s . 

On substituting this estimate into (18.61) . the conclusion of the lemma follows. 

□ 

We conclude this section by extracting two simplified bounds that may be 
conveniently deployed in our iteration. 

Lemma 8.2. Suppose that a and b are integers with ^ a < b ^ 6~ x . Then 

[[K a , b (X)\] < x r >^+\M h - a ) K . 

Proof. Consider fixed d-tuples £ and t? with 1 ^ £ ^ p a and 1 ^ 77 ^ p b , and 
r-tuples <r,r G S r . Considering the underlying Diophantine system and then 
applying Holder's inequality, we obtain 

K£(X&rj) < <f> |f a (a;0 2r Wa;r7) 2s |da 

(^|f (a;£)| 2s+2 Ma) r/(S+r) (y|f 6 (a;r7)| 2s+2 Ma 



s/(s+r) 
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Next applying Lemma [5. 1\ we deduce that 

K a>b (X) « (J s+r (2X/M a )Y^ s+r \j s+r (2X/M b )) s ^ s+r \ 
whence by (14.221) we arrive at the upper bound 

X s ( (X /M a ) r / (s+r ) (X /M b ) s ^ s+r ^ y( s + r ) d - K +v s +r 



[[KaAX)]] « 



(X/M b ) 2sd (X/M a ) 2rd - K 

The conclusion of the lemma follows. □ 

The basic iterative relation follows by applying Lemma 17.31 in combination 
with Lemma [8.11 

Lemma 8.3. Suppose that a and b are integers with ^ a < b ^ ^(kO)^ 1 , 
and put H = (k — 1)6. Then there exists an integer h, with ^ h < H , having 
the property that 

[[K a>b {X))) «X 5 M-( 2s - r+1 )^ s (X/M 6 )^( 1 -^)[[X b , fc6+ ,(X)]]^ 

+ M~ rH/ ^ s \X/M b ) r ' 8+r . 

Proof. We deduce from Lemma [8. II via (14.221) that 

[[K aib (X)}} < (M b ) 2sd {M a ) 2rd - K M 2ikb - a > d - K ^Tl" r/s T r 2 ls , (8.7) 

where 

J s+r (2X/M b ) I b , kb {X) 
j-i — n , , „ and io 



X2(s+r)d-K z X2{s+r)d-K ' 

But 

Ti < (M- b ) 2 ( s+r ^- K (X/M b ) r > s + r+5 . (8.8) 
Also, on putting H = {k — 1)6, we see that 

kb + H = (2k-l)b< -1 . 



Then it follows from Lemma fT73l that there exists an integer h with ^ h < H 
having the property that 

(M h ) 2s{d -^ +r ~ l K bM+h (X) M- H / 2 (X/M b ) r i°+ r 

2 ^ J^2(s+r)d~K ^ ^J^fkb^2sd^]^/[byrd-K " 

Thus we see that 

T 2 < (M- kb ) 2sd (M- b ) 2rd - K n, (8.9) 

where 

ft = M-( 2s - r+1)h [[K bM+h (X)}} + M~ H/2 (X/M b )^ +r . 
On substituting (18.81) and (18. 9p into (18.71) . we deduce that 

\[K a ^X)}} < M ^,b)^ X ^ M bYl-r/s)(r, a+r +5) Q r/s^ 

where 

u(a, b) = 2sdb + (2rd - K)a + 2(fc6 - a)rrf - K(b - a) 

- (1 - r/s)(2(s + r)d - iT)6 - (2sdA;6 + (2rd - K)b)r/s. 
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A little effort reveals that u(a, 6) = 0, and thus we obtain 

[[K a ,b(X)]\ « ( M -H/2y/s^ x i M by a+r+ 8{l-r/s) 

+ X 5 M^ 2s - r+1 ^ hr / s (X/My^ 1 - r / s \[K bM+h (X)]} r / s . 

The conclusion of the lemma follows on noting that 5 is assumed small enough 
that (X/M b ) S( - 1 - r M < M rH l^ s \ □ 

9. The iterative process 

Beginning with an application of Lemma l7.4[ which bounds J s+r (X) in terms 
of K 0j i(X), we may apply Lemma 18731 to bound J s+r (X) in terms of K a ^{X) 
for an increasing sequence of parameters a and 6. Our goal in this section is 
to manage this process, deriving useful information from the iterations. Our 
first step is to extract from Lemma 18.31 a conclusion transparent enough to be 
applied as the basic tool in each iterative step. 

Lemma 9.1. Suppose that a and b are integers with ^ a < b ^ ^(kO)^ 1 . 
Suppose in addition that there exist non-negative numbers if), c and 7, with 
c ^ 3(s/r) N , for which 

XVs+r (i+w) ^x cS M^[[K atb {X)]]. (9.1) 

Then, for some non-negative integer h with h ^ (k — 1)6, one has 

XVs+r (i+^'8) < x c ' s M-^'[[K a ,, b ,(X)]], 

where 

if)' = (s/r)if> + (s/r — 1)6, 7' = (s/r)7 + (2s — r + l)h, 
c' — (s/r)(c + 1), a' = b and b' = kb + h. 

Proof. We are at liberty to assume that c ^ 3(s/r) N and 5 < (Ns)~ 3N , so we 
have 

c5 < ls~ 2N < 9/ (3s), 

and hence X cS < M l ^ 3s \ In addition, one has M 1 ^ 3 ^ > X s . We therefore 
deduce from Lemma [8.31 that there exists an integer h with ^ h < (k — 1)6 
having the property that 

pTa,&P0]] < M- r '^ s) X^ +r 

+ x 5 (A7M b ) (Ws),?s+r (M- {2s - r+1)h [[K bM+h (X)]]) r/s . 

On making use of the hypothesised bound ( 19. ip . and employing in addition the 
lower bound r ^ 2 to confirm that X cS M~ r ^ 3s ^ <C X~ s , we therefore obtain 
the estimate 

X ri a+r (i+i,e) < x (c+1)5 M~ 7 " (2s " r+1)rVs (X^ 
+ X Vs+r ~ s , 

whence 

X ns+r(r/s+(i)+(l-r/s)b)e) ^ x {c+l)8 j^--j-(2s-r+l)rh/s^ b kb+h (X)Y^ '■ 
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The desired conclusion now follows on raising left and right hand sides here to 
the power s/r. □ 

Our final task is to analyse the growth of the parameters as the iteration 
proceeds, since from this we are able to extract the conclusion of Theorem 12.11 

Lemma 9.2. Put s = rk. Then one has r] s+r = 0. 

Proof. We may suppose that r] s+r > 0, for otherwise there is nothing to prove. 
We begin by defining three sequences (a n ), (b n ), (h n ) of non-negative integers 
for ^ n ^ N. We put oq = and bo — 1. Then, when ^ n < N, we fix 
any integer h n with ^ h n ^ (k — l)b n , and then define 

a-n+i = b n and b n+1 = kb n + h n . (9.2) 

We next define the auxiliary sequences (ip n ), (c n ), (7„) of non-negative real 
numbers for ^ n ^ iV by putting ipo — 0, Cq — 1, 70 = 0. Then, for 
^ n < N, we define 

i[) n+1 = 0/r)V>n + (s/r - l)b n , (9.3) 
c n+1 = (s/r)(c n + l), (9.4) 
7n+ i = (s/r)7„ + (2s - r + l)h n . (9.5) 

It is apparent that 7„ is non-negative for n ^ 0, and an inductive argument 
shows that for ^ n ^ N, one has 

c n = ^^(s/rf - — < (2 + -i-) (s/r)" < 3(s/r)". 
s — r s — r \ k — 1/ 

We claim that a choice may be made for the sequence (h n ) in such a manner 
that for ^ n ^ N, one has 



6 n < VN(s/r) n (9.6) 

and 

j^+^i-HM) ^x c " s M-^[[K anK (X)}}. (9.7) 

When n = 0, the relation (19. 6p holds by virtue of the definition of bo, and the 
relation (19. 7p holds as a consequence of (14 .22 p . (I4.23P and Lemma [731 since 
the latter implies that 

X^- 5 < l[J s+r (X)}} « [[K 0tl (X)}}. 

Before considering larger indices n, we conduct a preliminary analysis of the 
recurrence relations (I9.2l) - (l9.5p . Observe that when n ^ 0, one has 

7 n+ i = (s/r)7„ + (2s - r + l){b n+1 - kb n ), 

whence 

7„+i - (2s - r + l)b n+ i = (s/r)(7„ - (2s - r + l)b n ). 
Then we deduce by induction that 

7„ = (2s - r + l)6n + (s/r)"( 7o - (2s - r + 1)6 ) 

= (2s-r + l)(6 n -(s/r) n ). (9.8) 
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Suppose next that the desired conclusions (19. 6p and (19. 7p have been estab- 
lished for the index n < N . Then from (19. 6p and (14. 14p one has kb n 9 < 
k(s/r) n ~ N ~ 2 < ~, whence b n < \(k9)~ l . By appealing to Lemma |9~T1 we de- 
duce from (19.71) that there exists a non-negative integer h, with h ^ (k — l)b n , 
for which one has the upper bound 

XVs+r (i+iP'e) < X C ' S M-^' lK a . tV (X)]] , (9.9) 

where 

a' = b n = a n+1 , b' = kb n + h, (9.10) 
if/ = (s/r)ip n + (s/r - l)b n = ip n+1 , 
c = {s/r)(c n + 1) = c n+u 

i = {s/r) ln + {2s-r + l)h. (9.11) 

Suppose, if possible, that b' ^ \fN{s/r) n+1 . The relations (EE]), fl9TT0|) and 
f l9~TTj) then show that 

V = (s/r) (2s - r + l)(6 n - (s/r) n ) + (2s - r + - kb n ) 
= (2s-r + l)(6 / -(s/r) n+1 ) 

> (l-l/v / iV)(2s-r + l)6 / . (9.12) 

But b' = kb n + h ^ (2 A; — l)6 n < and so it follows from Lemma [8.21 that 

[[K al)b ,{X)}}^X^ + \M b ') K . 

On substituting this estimate together with (19.121) into (19. 9p . we obtain the 
upper bound 

Xr) s+r {l+ip n+1 d) ^ ■gr ls+r +{c n+1 + l)S^b'^K-(2s-r+l)(l-l/VN) 

We recall that c n+1 ^ 3(s/r) n+1 , so that X^+ 1+l)S < M 1 / 2 . Also, 

K - (1 - l/v / iV)(2s -r + Vj^rk - (2rk - r + 1) + 2*/^ < -1. 
Thus we obtain 

j£r) s+r (i+ip n+1 6) ^ j^vs+r M l ~ b> <C X T,s+r M~ 1 

Since VVi+i and are both positive, we are forced to conclude that r) s+r < 0, 
contradicting our opening hypothesis. The assumption that b' ^ \/N(s/r) n+1 
is therefore untenable, and so we must in fact have b' < \/N (s/r) n+1 . We now 
take h n to be the integer h at hand, so that V = b n+ \ and 7' = 7«+i, and thus 
we confirm the upper bounds (19. 6p and (19.71) with n replaced by n + 1. 

We now collect together our various bounds on the parameters in question. 
We have (19. 6p and (19. 7p for ^ n ^ N, and also the bounds c„ ^ 3(s/r) n and 
7„, ^ 0. Also, by induction one finds that b n ^ k n and 

V>„+i = k *Pn + (k- l)b n ^ ki) n + (k - l)k n , 
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whence ip n ^ n(k — l)A; n_1 . Finally, one has &tv# < (r/s) 2 < |, so that 
6at < By applying Lemma 18.21 in combination with (j9.7j) . we therefore 
obtain the estimate 

Making use again of the relation 9 = N~ 1 / 2 (r/s) N+2 from f)4.14p . we conclude 

that 

rk y/Nrk{s/r) N+2 rk 4 

Vs+r ^^9^ N{k-l)k N -^ < 7F 
We may take N to be as large as necessary in terms of k and r, and thus 
Tj s _\_ r can be made arbitrarily small. We are therefore forced to conclude that 
r] s+r = 0, and this completes the proof of the lemma. □ 



The conclusion of Theorem I2.ll is an immediate consequence of Lemma I9.21 
for in view of (I4.2ip and (I4.24p the latter shows that when s ^ r(k + 1), then 
one has 

J S (X;F) ^x 2sd ~ K+e . 

10. Estimates of Weyl type 

The extraction of estimates analogous to that of Weyl is in general difficult, 
owing to the complexity of the multidimensional situation. Although it would 
be feasible, with extra space and care, to analyse directly the exponential sum 
f(ct; X; F) encoding quite general translation-dilation invariant systems F, we 
have chosen here to instead restrict attention to the Weyl sums associated with 
the system described in example (b) of $2j Such Weyl sums may be applied so 
as to bound the apparently more general sums f(a.; X; F) mentioned above, at 
the cost of somewhat weaker estimates. Since our upper bounds for J S {X; F) 
are essentially optimal with s = r(k + l), it transpires that the slight weakening 
of the Weyl exponent does not impede the bulk of applications. 

We stress then that throughout this section, until indicated otherwise, the 
system F is understood to be defined by 

F = (zl'zi 2 ...zj : 1 ^ |i| < k), 

and the corresponding Weyl sum f(ct; X; F) we abbreviate to /(ck). We define 
r and K as in (12.61) and (12.71) . We must also consider the system F' defined by 



F' = (zl 1 4 2 . . . z d d : 1 < |i| < Jfe-i; 



This system has rank 
and weight 



r> = ( k + d - l \-l. 



AT' = -i-(r' + l)(fc 

(3+1 



The hard work involved in estimating f(ct) has been presented by the first 
author in [9] §5], though here we take the opportunity to clarify one or two 
issues. 
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Theorem 10.1. Fix an index j with 2 ^ |j| ^ k, and put a = (2r'k)^ 1 . Let 
et £ W , and suppose that a and q are integers with q ^ 1, (a,q) = 1 and 
\qctj — a\ ^ q . Then one has 

\f(a)\ < X d+£ {q- 1 + X' 1 + qX^y. 

Proof. The conclusion of the theorem is essentially immediate from |2 Theo- 
rem 5.1]. We apply Theorem 12.11 to show that when s = r'k, then one has 

J 8 {X;F') ^X 2sd ~ K ' +A , 

with A ^ e. Since o = l/(2s), the upper bound for follows from the 

aforementioned estimate Theorem 5.1]. □ 

We add a few words of clarification to the proof of the latter conclusion 
in order to serve our purposes in the proof of Theorem 110.21 The index j = 
(ji? • • • ,jd) contains at least one coordinate ji satisfying ji ^ 1. By relabelling 
variables, if necessary, one may suppose that 1 = 1. At the top of page 24 of 
[9], it is asserted that there is no loss of generality in supposing that j\ ^ 1, an 
assertion that is made otiose given our relabelling of variables. However, it is 
apparent that the argument of this proof may be reorganised so that relabelling 
is unnecessary, with the central elements of the argument focused on the index 
ji instead of j\. The only issue to stress is that the set C [1, N] HZ prepared 
at the end of the proof of [HI Theorem 5.1] now depends on I. 

Our next estimate refines Theorem 110.11 so that many coefficients are ap- 
proximated simultaneously with control over a common denominator. In this 
context, when 9 £ K, we define = minl^ — y\. 

Theorem 10.2. Let v be a positive number, let A be a real number satisfying 
1 ^ A < X d , and write Q = X u (X d A- 1 ) 2r ' k . Suppose that \f(a)\ ^ A, and 
that Q <C X 1-25 for some 5 > 0. Let I be an integer with 1 ^ I ^ d. Then 
for each index j = (ji, • • • ,jd) with 2 ^ |j| ^ k and ji ^ 1, there exist a- s £ Z 
and gj £ N with {a^qy} = 1 and \qj(Xj — Oj| ^ QX S ~^. Moreover, the least 
common multiple q^ of the numbers q- } with 2 ^ |j| ^ k and ji ^ 1 satisfies 
qH ] < Q(logX) 2r ' fc and 

rfaj|| « Q 2 (\ogXr' k X 5 -U (2 < |j| < k, 3l > 1). 

Proof. We apply the argument of the proof of [9l Theorem 5.2]. Consider an 
index j = (ji, . . . ,jd) with 2 ^ |j| ^ k, and let I be any integer with j; ^ 1. 
It follows from Dirichlet's Theorem that there exist coprime integers q- } and a- } 
with 

1 < Qj < Q~ l X Ys \- 5 and Igjaj — aj | < QX 5 ~ lil . 
Keeping in mind the discussion following the proof of Theorem 110.11 we may 
apply the argument of the proof of [9], Theorem 5.2] to deduce from Theorem 
HO that gj < Q(logX) 2r ' fc < X 1 " 5 . Now fix an integer x £ [1,X], and 
suppose that there is an integer y £ [1,X] such that 

\\{k\) k aj (x - y)\\ ^ X 1 -® 
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for each index j with 2 ^ |j| ^ k satisfying ji ^ 1- Then the argument of the 
proof of [9j Theorem 5.2] following [9j equation (5.8)] shows that gj divides 
(k\) k dj(x — y) for each of the latter indices j. Write q$ for the least common 
multiple of the integers q- } with 2 ^ |j| ^ k and ji ^ 1. Then the argument of 
the proof of [9, Theorem 5.2] shows that 

g«<<Q(logX) 2 ''' fc . (10.1) 

Note here that our hypothesis ji ^ 1 permits the relevant part of the argument 
of P Theorem 5.1] to be applied successfully, reflecting the discussion above 
following the proof of Theorem 110.11 In addition, for 2 ^ |j| ^ k and ji ^ 1, 
one has 

hS^iW < q^H^W « Q(\ogXf r ' k (QX s -^) = Q 2 (\ogX) 2r ' k X 5 -^ . 
This completes the proof of the theorem. □ 

We remark that the statement of [91 Theorem 5.2] should be modified to 
reflect the argument concluding the above proof, so that the conclusion of 
Theorem 5.2] asserts only that go *C Q d (\ogP) 2sd . This issue follows through 
the work of [9j discussing Weyl sums. In particular, the conclusion of [9, 
Theorem 1.2] should be modified to impose a condition of the shape a -1 ^ 
|(<i + l)rk\og(rk). 

The next result is established via a Baker-style "final coefficient lemma" 
argument (see [21 Lemma 4.6]). 

Theorem 10.3. Let k be an integer with k ^ 2, and let t be a real number 
with r- 1 > (2r'k + l)(d + 1). Suppose that \f(a)\ ^ A ^ x d ~ T+u for some 
v > 0, and write Y = (X d A~ 1 ) k+U . Then there are integers aj and q satisfying 

(g,a) = l, l^g<F and \qa- 3 - Oj| < YX~ lil (K |j| < Jfe). 

Proof. This is immediate from the argument of the proof of [HI Theorem 5.5]. 
Write W = X 1 "^ 1 ^. We put s = r'k and note that r(2s + l)(d + 1) < 1, 
whence 

(X d A~ 1 ) 2S ( d+1 ^ > <C (X T ~ V ) 2S ( d+1 ^ <C xl-(d+l)r-2s(d+l)v _ j£-2s{d+l)u 

As in the proof of [HI Theorem 5.5], we may apply Theorem 110.21 to show that 
for 1 ^ I ^ d, there exist integers with the property that 

1 < gj < X u (X d A- 1 ) 2r ' k (\ogX) 2r ' k < w 1/{d+1) X- s », 

and satisfying the condition that whenever 2 ^ |j| ^ k and ji ^ 1, then 

llg^ajH « X 2 »(X d A- l ) Ar ' k (\ogX) 2r ' k X s -W « W 2 ^X S ~^. 

We take g to be the least common multiple of q^\ . . . , q^. In this way, we 
deduce first that 

l ^ g ^ ?? } • • • q ( d) « (wW+Vx-^y « w d ^ d+1 \ 
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Suppose next that 2 ^ |j| ^ k. Then there is some index I for which ji ^ 1, 
and we have 

Ml ^ ( J] io m) )ho l) ^\\ « (^ 1/(w) r w )rf-i W 2/( d+ i) X 5-|j|_ 

m^Z 

In this way, we find that whenever 2 ^ |j| ^ fc, then without any condition on 
ji, . . . , j d , one has 

||g «j|| < WX 5 -^. 

An application of [91 Lemma 5.4] completes the proof of the theorem, just as 
in the proof of 0, Theorem 5.5]. □ 

The conclusion of Theorem 11.31 follows at once from Theorem 110.31 as a 
special case. It seems worthwhile at this point to extract from Theorem 110.31 a 
conclusion that serves to estimate f(ct; X; F) for general translation-dilation 
invariant systems F. 

Theorem 10.4. Let F be a reduced translation- dilation invariant system of 
polynomials having dimension d, rank r and degree k. Define the exponent p 
by means of the relation 

^=^2k(^ + d d ~ 1 ^-2k + l ) ) (d + 1). 

Suppose that |/(a;X;F)| > A ^ x d ~^ +u for some v > 0. Write Y = 
(X d A~ 1 ) h+u . Then there are integers aj and q, satisfying 

(q, a) = 1, l<g<y and \qctj - aA < YX~ k > (1 ^ j ^ r). 

Proof. By assumption, the polynomials -P}(x) (1 ^ j ^ r) are homogeneous, 
and satisfy the translation-dilation invariance relation (12.31) for a suitable lower 
unitriangular matrix C(£). Write F for the column vector (Fj(x))i^<g r , and X 
for the column vector (x^i^m^, in which the entries are arranged in ascending 
colex order. Finally, put 

'k + cT 



The monomials x 1 span the space generated by {Fx, . . . , F r }, and so one may 
write F = SIX, with 21 an r x p matrix having integer entries depending only 
on the coefficients of F. The linear independence of the system F ensures that 
there exists an invertible p x p matrix 53, having rational entries depending 
only on the coefficients of the system F, having the property that 2153 is an 
r x p block matrix of the shape (O I r ), with O the r x (p — r) zero matrix, 
and I r the r x r identity matrix. Observe that 53 will have a block structure 
which associates to Fj monomials x 1 with |i| = kj. 

Suppose that \f(a; X;F)\ ^ A ^ X d -^ +v , for some v > 0. One has 

r 

f(a;X;F)= £ eQTo^x)) = £ e( £ (3 y 
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wherein we have written (3 = 2l T ai. Thus we have 



E < E ft* 



^ A ^ 

It follows from Theorem 1 1 . 3 1 that there exist integers Cj and v, with (v, c) = 1, 
satisfying 

and Iwjflj - Cj| < (1 ^ |j| ^ k). (10.2) 

Now <B T /3 = (2l«B) T a. Since 21<B = (O I r ), we see that 

(0,..., 0, ai,..., a r ) T = <B T /3, (10.3) 

so that ai,..., a r are given by linear combinations of the real numbers /3j 
(1 ^ |j| ^ k), with rational coefficients depending at most on the coefficients 
of F. 

Let Qq be the least natural number having the property that f2 *B has in- 
tegral entries, and let Q be the largest of the absolute values of the entries of 
Qq^B. Then it follows from (110. 2p and (110.31) that for 1 ^ i ^ r, one has 

\i\=h 

Write q = vQq and Z = pQY. Then we find that 1 ^ go Z and there 
exist integers 6j (1 ^ % ^ r) such that \qoOti — b{\ <C ZX~ fei (1 $C i ^ r). The 
conclusion of the theorem follows on putting g = (q , b), writing q = qo/g and 
di = 6j/<7 (1 ^ i ^ r), and noting that Z < 7. □ 

11. Asymptotic formulae associated with Diophantine equations 



The mean value estimate supplied by Theorem 12.11 may be routinely com- 
bined with estimates of Weyl type, explored in CTOl so as to establish asymp- 
totic formulae for the number of solutions of associated Diophantine systems. 
Such consequences have been examined already in the literature, and so our 
goal in this section is to sketch some conclusions, and outline the arguments 
necessary for their proofs. In particular, we deliberately avoid going into detail 
concerning proofs of these new results so as to avoid adding bulk to an already 
lengthy memoir. 

We begin by establishing an asymptotic formula for a general counting prob- 
lem of which Theorem ll.4l is essentially a special case. Consider then a reduced 
translation-dilation invariant system of polynomials F having dimension d, 
rank r, degree k and weight K. Let s be a natural number, and consider fixed 
non-zero integers c^- for 1 ^ i ^ r and 1 ^ j ^ s. Finally, let N s (X;F;c) 
denote the number of integral solutions of the Diophantine system 



E 



CijFfa) = (1 < i < r), (11. 1] 



with 1 ^ x ^ X . For the time being, it is convenient to abbreviate f(ct; X; F) 
to /(a). 
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Theorem 11.1. Suppose that s ^ 2r(k + 1) + 1. Suppose further that Cij 
(1 ^ i ^ r, 1 ^ j ^ s) are non-zero integers, and that the system ( fjj.jp has 
both a non-singular real solution, and a non-singular p-adic solution for every 
prime p. Then there exist positive constants *3i = S)(s, F, c) and v = u(s, F, c) 
such that 

N S (X; F; c) = @X sd ~ K + 0{X sd ~ K - u ). 

Proof. We begin by denning a Hardy-Littlewood dissection. When < 9 ^ 1, 
let DJIq denote the union of the boxes 

Wl e (q,a.) = {a G [0,l) r : {gat-ml^X*-** (1 < % < r)}, 

with 1 ^ q ^ X e , ^ a ^ q and (g, a) = 1. Complementing the major arcs 
DJlg, we define the minor arcs xxig = [0, l) r \ DJIq. We claim that whenever 
s ^ 2r(k + 1) + 1, then for a positive number S = S(s, F), one has 

/ \f{(3)\ s dp<^X sd - K ~ s . (11.2) 

Jm 1/3 

Define fj(ct) to be f{c\ ). Then from the upper bound (111.21) it 

follows by means of Holder's inequality that 

n/i(«)d«<n(/ i/»rd«) i/s 

m 1/2 J = 1 3 - =1 Vm 1/2 

< max / |/( / 3)| s d/3<X sd - x - <5 . (11.3) 
In order to confirm the estimate (111.21) . observe first that 

(\ s-2r(fc+l) 
sup |/(a)|) /|/(a)| 2 ^ +1 )da. (11.4) 

«e»i/3 y ^ 

By applying Theorem 12.11 one sees that 

j \f(a)\ 2r ^ da < x 2r ^~ K+£ . (11.5) 
Next, define a by means of the relation 

If we hypothesise that |/(ck)| ^ X d_CT , then it follows from Theorem 110.41 that 
there exist integers a« and q, with 

1 ^ q ^ X 1/4 , (g, a) = 1 and \qa { - a»| < X" fcl+1/4 (1 < i ^ r). 

Then one must have ck G SDT1/3, and thus we infer that 

sup |/(a)|a^. 

aem 1/3 

The desired estimate fl 11 . 2 [) follows on combining (111.41) and (111.51) . provided 
that we take 5 < a. In view of our earlier discussion, this confirms the estimate 
(HOD. 
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In order to estimate the contribution of the major arcs WI1/2, we may follow 
the argument of [HI §6]. We avoid providing many details here. We write 



and 



;(/3)= / e(Y\ mh))d~f, 



and then put 

V(a; q, a) = q~ d S(q, a)v(a — &/q). 

Define Vj(ct) to be 

V(cijO>i, . . . , c rj a r ; q, c^-ai, . . . , c rj a r ) 

when a G VJli/ 2 (q, a) C Wly2, and otherwise put Vj(a) = 0. Then by adapting 
the argument of [HI Lemma 5.3], one finds that 

sup |/ J (a)-V J («)|«X rf - 1 / 2 . (11.6) 
aean 1/2 

Since mes(9Jti/2) <C X^ r+1 ^ 2 ~ K , it follows via Holder's inequality that for some 
positive number u, one has 



/ /,-(<*) da- [ H^Hda 

Jm 1/2 j=1 Jsjix/ 2 j=1 

<X d - 1/2 max / \V j (a)\ s - 1 da + X sd - K - / . 



H/2 

Reversing course, one finds from (I11.6P via Theorem I2.ll that whenever s ^ 
2r(k + 1) + 1, then 

I |^-(a)| s - 1 da< / |/ J (a)| s - 1 da + X( s - 1 ^- !y 
Jtati/3 J mi/2 

< j \})(cx)\ s - 1 da + X { - s - l)d - R - v 



Thus we deduce that 



iV s (X;F;c)= / ] f [/ j (a)da+ / ]}£(<*) da 

= / TTv ? (a)da + 0(X sd - / ^ i/ ). (11.7) 
^1/2 j=1 

The argument of the proof of 0, Theorem 6.2] is readily adapted to show 
that 

/ TT Vj(a) da = 3GX sd ~ K + 0{X sd ~ K ~ v ), (11.8) 
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for some v > 0, where 

■/if J[o,i]^ y ~[ ~[ ' 

and 

oo s 

& = Y Y ( f sd X\S(q,c lj a l ,...,c rj a r ). 

q=l l^a^g j=l 
(<Z,a)=l 

The absolute convergence of Z and © follows via the methods of [9j §§5 and 
6]. Here, the existence of non-singular real and p-adic solutions suffices to 
guarantee that 3 > and & > (see [S] and [H] for the necessary ideas). On 
substituting into (111.71) and (111.81) . the desired conclusion follows. □ 

Note that, in order to count solutions of the system (lll.ip with |x| ^ X, 
one may merely add together the contributions from the 2 d sectors accommo- 
dating the various constellations of signs amongst the d coordinates, and thus 
Theorem 11.41 is an immediate consequence of Theorem 111.11 corresponding to 
the special translation-dilation invariant system 

F = (x 1 : 1 < |i| ^ k). 

The special case of Theorem 111.11 in which s = 2r(k + 1) + 2 and Cy = (— iy 
(1 ^ j < s) delivers the following corollary. 

Corollary 11.2. Suppose thatt ^ r(fc+l) + l. Then there are positive numbers 
£ = €(t, F) and 5 = 8(t, F) such that 

J t (X; F) = £X 2td - K + 0(X 2td ~ K - 5 ). 

We remark that the positivity of the product of singular integral and singular 
series, giving £ > 0, is in this case a consequence of the lower bound provided 
by Theorem 13.11 Note that Theorem 11.51 is a special case of Corollary 111.21 
corresponding to the same special choice of system as above. 

We now move on to consider Theorem 11.61 Here we may follow the sketch 
provided at the end of [HI §6]. We write 

9j{<*) = Y e (X^' aix1 )- 

\x\^X \i\=k 

Then it follows from Theorem 11.11 that whenever 

o((^> +1 >, 

then 

f\gj(a)\ 2s da<t:X 2sd - L+£ . 

The argument concluding [9j §6] then suffices to prove Theorem 11.61 

Finally, we consider the translation-dilation invariant system F, as in the 
preamble to Theorem 111. If with applications in additive combinatorics in 
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mind. We therefore consider non-zero integers ci,...,c s satisfying the con- 
dition ci + . . . + c s = 0, and we investigate the solubility of the Diophantine 

system 

ciF(xi) + . . . + c s F(x s ) = 0. (11.9) 

We define projected and subset-sum solutions in a manner transparently anal- 
ogous to that in the preamble to Theorem 11.71 

Theorem 11.3. Suppose that s ^ 2r(k + 1) + 1 and s > K + d 2 . Let a (1 < 

i ^ s) be non-zero integers satisfying c\ + . . . + c s = . Suppose further that the 
system of equations hi 1.9) possesses non-singular real and p-adic solutions for 
each prime number p. Let stf C Z d fl[l, N] d , and suppose that the only solutions 
of the system $11.9) from stf are either projected or subset-sum solutions. Then 
one has 

card(^) < iV^loglogiV)- 1 ^- 1 ). 

The conclusion of Theorem 11.71 is a special case of Theorem lll.3[ as we now 
confirm. For in the special circumstances relevant to the statement of Theorem 
ll.7l one has rk ^ K, and when k ^ 2 one has in addition 

(A _i_ h\ 

2r + 1^2f J - 1 ^ (d + 2)(d+ 1) - 1 > d 2 . 

Thus we find that the hypothesis s ^ 2r(k + 1) + 1 already ensures that 
s > K + d 2 , and Theorem 11.71 consequently follows as a direct corollary of 
Theorem 111.31 

The proof of Theorem 111.31 follows by adapting the methods of [llj to this 
more general situation wherein d may exceed 2. The conclusion of Theorem 
II 1.11 may be employed to show that when s ^ 2r(k + 1) + 1 and s$ is of suitable 
linear uniformity with relative density S, then the number jV{s$} of solutions 
of (I11.9P with xG/ satisfies 

> c 5 s N sd ~ K . 

We claim that the number ^(srf) of projected solutions is 0(N s<yd ~ l "> +d2 ), and 
that the number <A\(srf) of subset-sum solutions is 0(N sd ~ K ~ 1 l s ). Granted 
this claim, one finds that jV(srf) > cjVq(&&) + jy{{s^) provided that 

and this relation is satisfied whenever s > K + d 2 and 5 ^> (logiV)^ 1 , for 
example. Under such circumstances, we conclude that the system (111.9P con- 
tains solutions from s$ that are neither projected nor subset-sum solutions. In 
particular, if the only solutions of (I11.9P from s4 are either projected or subset- 
sum solutions, then card(<g/) <C iV d (log iV) _1 . On the other hand, if srf is not 
of suitable linear uniformity, then a concentration argument motivated by that 
of Roth [12J and described in the proof of [HI Lemma 5.3] may be adapted 
to this potentially higher dimensional setting to show that in the absence of 
non-trivial solutions, a suitable sub-progression can be obtained with higher 
relative density than that of srf . By iterating this argument, one deduces that 
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in circumstances wherein the only solutions of the system fill. 9p from srf are ei- 
ther projected or subset-sum solutions, then card(«g/) <C iV d (loglog N)^^ 8 " 1 ' . 
We refer the reader to [11] §5] for details. 

We have still to justify our earlier claim. We begin by counting the number 
of projected solutions ,jV^s^\ Let x = (xi, . . . , x s ) be any projected solution 
counted by Then there exists a translate aGR d with the property that 

span M {xi — a, . . . , x s — a} is a vector space of dimension m < d. There is no loss 
of generality in taking a = x 1; and then span R {x! — a, . . . , x s — a} has a basis of 
the shape S3 = {x ix — x 1; . . . , x im — xi} for some indices 1 < i\ < . . . < i m ^ s. 
Since —N < x i; — x x < N for 1 ^ I ^ m, one sees that the number of possible 
choices for S3 is 0((N d ) m ). But there are trivially 0(N d ) possible choices 
for Xi, and so we find that the number T of possible translated vector spaces 
X! + span R ^ satisfies 

T < N d (N d ) m ^ N d (N d ) d ~ 1 = N d2 . 

The integral vectors lying in V = span M ^ form an integral lattice of dimension 
m, and hence volume considerations confirm that the number of vectors y e Z d 
such that -2N < y < 2N and y € V is at most 0(N m ). Thus we deduce 
that for each index i with 1 ^ i ^ s, and each m < d, the number of possible 
choices for Xj — xi is at most 0(N m ). It follows that for fixed xi and S§, the 
number of possible choices for xi, . . . , x a is at most 0(N ms ). Then the total 
number of possible projected solutions xi, . . . , x s is 

JYq{s^) < N ms T < N ms+d? ^ N {d ~ 1)s+d2 . 

We turn next to the task of bounding ^Ai(^/). The number of partitions 
{1, . . . , s} = J?\ U . . . U flx, with / ^ 2 and the sets J? v disjoint and non-empty, 
is plainly O s (l). Let ,yV 2 (N; ^fli, . . . , J?i) denote the number of solutions of the 
system 

c u F(x u ) = (1 ^ v ^ I), 

with 1 ^ x ^ N. Then it follows that there exists a partition {1, . . . , s} = 
J?\ U . . . U of the aforementioned type, for which 

^(^)«^ 2 (iV; £ /i,..., £ //). 

Write m v = card( t? ^„) (1 ^ v ^ I), and note that 

mi + . . . + mi — s. (11.10) 

Finally, let n denote the number of the sets ^ v with 1 ^ v ^ I satisfying the 
property that caxd(^ v ) = 1. 

Our next step is to consider the contribution of the subset-sum equations 
defined by sets ^f v , distinguishing two cases. Suppose first that caidi^^) = 1 
and JP V = {u}. Since we may suppose c u to be non-zero, it follows from Lemma 
15.21 that the number of solutions of the system of equations c u F(x M ) = 0, with 
1 ^ x M N, is 0{N d ~ 1 ). 
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Suppose next that caid{^ v ) > 1. Abbreviating /(a; N; F) to /(a), a trivial 
estimate yields 



j\f(ct)\ 2 dcx^N 2d - 1 . 



On the other hand, a trivial estimate in combination with Theorem 12 . 1 1 delivers 
the bound 

\f(cx)\ s &ct<^N sd - K+£ . 



Then it follows from Holder's inequality and a change of variable that 

// r \ (m v -2)/(s-2) / C \ (s-m v )/(s-2) 

l/Mrda^y |/(a)| s da) U |/(«)| 2 da 

^ (A[ s d— K+£\(m v — 2)/(s— 2) ^ jy2d-l^ (s-m„)/(s-2) 
j\jm v d—m v K / (s-n)~v v +e 

where 

m„ — 2 \ „ / m„ \ „ s — m„ 



- K + 



s — 2 y \s — n J s — 2 

From here, a consideration of the underlying Diophantine system, followed 
by an application of Holder's inequality, reveals that 



^ 2 (iV; ^ . . . , J x ) « (A^- 1 )" J] I J] 1/(^)1 da 

card( < /„)>l 

« iV^-D" JJ J] U\f( Cu a)\ m v dcx 

card(^„)>l 

jV^ -1 )™ Jj£ j^m v d-m v K/(s-n)-u v +e 
card( c /«)>l 

We therefore deduce from (jll.iop that 
where 



v = n 



card(^„)>l 



On recalling (lll.iop . we see that 



s-2 / s-2 

A modicum of computation reveals that 

(s - 2)v = n(s - 2) + (2 - 21 + n)K + s(Z - n) - (s - n) 
= (s-2K)(l-l)+n(K -1). 
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Our hypothesis on s ensures that s ^ 2rk + 1 ^ IK + 1, and we may suppose 
moreover that / ^ 2. We therefore deduce that v ^ l/(s — 2), and thus 

This completes the proof of our earlier claim, and hence the proof of Theorem 
111.31 is now complete. 
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